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IFN-ALPHA HQMOLQGUES 

5 COPYRIGHT NOTIFICATION 

Pursuant to 37 C.F.R. 1.71(e), a portion of this patent document contains 

material which is subject to copyright protection. The copyright owner has no objection to 
the facsimile reproduction by anyone of the patent document or the patent disclosure, as it 
appears in the Patent and Trademark Office patent file or records, but otherwise reserves 
10 all copyright rights whatsoever. 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part application of and claims the 

benefit of and priority to U.S. Patent Application Serial No. 09/145,483, filed October 7, 
1999, the disclosure of which is incorporated herein by reference in its entirety for all 
15 purposes. 

FIELD OF THE INVENTION 

The present invention relates to the generation of new interferon -alpha 

homologues. 

BACKGROUND OF THE INVENTION 

20 Interferon-alphas are members of the diverse helical-bundle superfamily of 

cytokine genes (Sprang, S.R. et al. (1993) Curr. Opin. Struct. Biol. 3:815-827). The 
human interferon-alphas are encoded by a family of over 20 tandemly duplicated 
nonallelic genes that share 85-98% sequence identity at the amino acid level (Henco, K. et 
al (1985) J. Mol. Biol. 185:227-260). 

25 Interferon-alphas have been shown to inhibit various types of cellular 

proliferation, and are especially useful for the treatment of a variety of cellular 
proliferation disorders frequently associated with cancer, particularly hematologic 
malignancies such as leukemias. These proteins have shown antiproliferative activity 
against multiple myeloma, chronic lymphocytic leukemia, low-grade lymphoma, Kaposi's 

30 sarcoma, chronic myelogenous leukemia, renal-cell carcinoma, urinary bladder tumors and 
ovarian cancers (Bonnem, E.M. et al. (1984) J. Biol. Response Modifiers 3:580; Oldham, 
R.K. (1985) Hospital Practice 20:71). 
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Interferon-alphas are also useful against various types of viral infections 
(Finter, N.B. et al (1991) Drugs 42(5):749). Interferon-alphas have shown activity 
against human papillomavirus infection, Hepatitis B, and Hepatitis C infections (Finter, 
N.B. et al, 1991, supra; Kashima, H. et al. (1988) Laryngoscope 98:334; Dusheiko, G.M. 
5 et al (1986) J. Hematology 3 (Supple. 2):S199; Davis, GL et al (1989) N. England J, 
Med. 321:1501). The role of interferons and interferon receptors in the pathogenesis of 
certain autoimmune and inflammatory diseases has also been investigated (Benoit, P. et al 
(1993)7. Immunol 150(3):707). 

Although these proteins possess therapeutic value in the treatment of a 

10 number of diseases, they have not been optimized for use as pharmaceuticals. For 
example, dose-limiting toxicity, receptor cross-reactivity, and short serum half-lives 
significantly reduce the clinical utility of many of these cytokines (Dusheiko, G. (1997) 
Hepatology 26:112S-121S; Vial, T. and Descotes, J. (1994) Drug Experience 10:115- 
150; Funke, I. et al (1994) Ann. Hematol 68:49-52; Schomburg, A. et al (1993) /. 

15 Cancer Res. Clin. Oncol 119:745-755). Diverse and severe side effect profiles which 
accompany interferon administration include flu-like symptoms, fatigue, neurological 
disorders including hallucination, fever, hepatic enzyme elevation, and leukopenia 
(Pontzer, C.H. et al (1991) Cancer Res. 51:5304; Oldham, 1985, supra). 

The existence of abundant naturally occurring sequence diversity within the 

20 interferon-alphas (and hence a large sequence space of recombinants) along with the 
intricacy of interferon-alpha/receptor interactions and variety of therapeutic and 
prophylactic activities creates an opportunity for the construction of superior interferon 
homologues. 

SUMMARY OF THE INVENTION 

25 The invention provides novel interferon-alpha (IFN-alpha or IFN-a) 

homologue polypeptides, nucleic acids encoding the polypeptides and complementary 
nucleotide sequences thereof, fragments of said polypeptides and nucleic acids, antibodies 
to the polypeptides, and uses therefor, data sets containing character strings of interferon- 
alpha homologue sequences, and automated systems for using the character strings. 
30 In one aspect, the invention includes an isolated or recombinant interferon- 

alpha nucleic acid homologue. Included are a polynucleotide sequences selected from 
SEQ ID NO:l to SEQ ID NO:35, or to SEQ ID NO:72 to SEQ ID NO:78, and 



WO 01/25438 



PCTAJSOO/27781 



complementary polynucleotide sequences thereof. Polynucleotide sequences encoding a 
polypeptide selected from SEQ ID NO:36 to SEQ ID NO:81 or from SEQ ID NO:79 to 
SEQ ID NO:85, and complementary polynucleotide sequences thereof are also a feature of 
the invention. Similarly, a polynucleotide sequence which hybridizes under highly 
5 stringent conditions over substantially the entire length of any of the preceding 
polynucleotide sequences is a feature of the present invention. In addition, a 
polynucleotide sequence comprising a nucleotide fragment of any of the preceding 
polynucleotide sequences which nucleotide fragment encodes a polypeptide having an 
antiproliferative activity in a human Daudi cell line- based cell proliferation assay is a 

10 feature of the invention. Similarly, a polynucleotide sequence comprising a nucleotide 
fragment of any of the polynucleotide sequences of the invention described above and 
below which encodes a polypeptide having antiviral activity in a murine cell line/EMCV - 
based assay is a feature of the invention. 

The invention also includes an isolated or recombinant nucleic acid, 

15 comprising a polynucleotide sequence encoding a polypeptide, wherein the polypeptide 
comprises the amino acid sequence: CDLPQTHSLG-X11-X12-RA-X15-X16-LL-X19-QM- 
X^-R^-S-Xzg-FSCLKDR-X^-DFG-Xss-^ 

X55-X56-X57-HE-X60-X6, -QQTFN^ 

X 85 -X 86 -T-X 88 -L-X9o-QQLN^^ 

20 X, ,4-D-Xi 16 -ILAV-Xi2i-KY-X,24-QRrrLYL-X l 32-E-X 1 34-KYSPC-Xuo- 

WEVVRAEIMRSFSFSTNLQKRLRRKE, or a conservatively substituted variation 
thereof, where Xn is N or D; X J2 is R, S, or K; X15 is L or M; Xi 6 is I, M, or V; X\ 9 is A or 
G; X 22 is G or R; X 24 is I or T; X 26 is P or H; X 34 is H, Y or Q; X 38 is F or L; X40 is Q or R; 
X45 is G or S; X46 is N or H; X47 is Q or R; X 50 is K or R; X51 is A or T; X55 is S or F; X 56 

25 is V or A; X 57 is L or F; Xoo is M or I; X 6 i is I or M; is L or F; X 72 is D or N; X75 is A 
or V; X 76 is A or T; X 78 is E or D; X 79 is Q or E; X 80 is S, R, T, or N; X 83 is E or D; X 85 is 
F or L; X 86 is S or Y; X 88 is E or G; X90 is Y, H, N; X95 is D, E, or N; X im is I, M, or V; 
X103 is E or G; X105 is G or W; X !0 6 is V or M; Xi 07 is E, G, or K; Xi 08 is E or G; Xi J4 is V, 
E, or G; Xi j 6 is S or P; X121 is K or R; X !2 4 is F or L; X132 is T, I, or M; X134 is K or R; and 

30 X140 is A or S. Each of the single letters of this amino acid sequence represents a 

particular amino acid residue according to standard practice known to those of ordinary 
skill in the art. 
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A polypeptide having any of the preceding sequences, such as those 
embodied in SEQ ID NO:36 to SEQ ID NO:54, is also a feature of the invention. 

In other embodiments, the encoded polypeptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO:36 to SEQ ID NO:54; and the 
5 nucleic acid comprises a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:l to SEQ ID NO: 19. 

The invention also provides polypeptide fragments of any of SEQ NOS:36- 
70 and SEQ ID NOS:72-79. In one aspect of the invention, such a polypeptide fragment 
exhibits an antiproliferative activity in a human Daudi cell line- based cell proliferation 
10 assay or an antiviral activity in a murine cell line/EMCV - based assay, or both said 

activities. The human Daudi cell line- based cell proliferation assay and antiviral activity 
in a murine cell line/EMCV - based assay are described in greater detail below. In yet 
another aspect, the invention provides a polynucleotide sequence comprising a nucleotide 
fragment of any nucleic acid of the invention described above and below, wherein said 
15 nucleotide fragment encodes a polypeptide fragment that exhibits an antiproliferative 

activity in a human Daudi cell line- based cell proliferation assay or an antiviral activity in 
a murine cell line/EMCV - based assay, or both activities, as is described in greater detail 
below. 

The invention also includes an isolated or recombinant nucleic acid 
20 comprising a polynucleotide sequence encoding a polypeptide, wherein the polypeptide 
comprises an amino acid sequence comprising at least 20 contiguous amino acids of any 
one of SEQ ID NOS:36-70. In other embodiments, the polypeptide of the invention 
comprises an amino acid sequence comprising one or more of amino acid residues (Tyr or 
Gln)34, Gly37, Phe38, Lys71, Ala76, Tyr90, Ilel32, Argl34, Phel52, Lysl60, and 
25 Glul66, wherein the numbering of the amino acid residues corresponds to the numbering 
of residues in the amino acid sequence of SEQ ID NO: 36. In various embodiments, the 
encoded polypeptide of the invention comprises at least 30, at least 50, at least 70, at least 
75, at least 100, at least 110, at least 120, at least 130, at least 140, at least 150, at least 
155, at least 160, or at least 165 contiguous amino acid residues of any one of SEQ ID 
30 NOS:36-70. In other embodiments, the encoded polypeptide is at least 150, at least 155, 
at least 160, at least 163, or at least 165 amino acids in length. In another embodiment, the 
encoded polypeptide is about 166 amino acids in length. In yet other embodiments, the 
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encoded polypeptide comprises an amino acid sequence selected from SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ 
ID NO:45, and SEQ ID NO:46. 

In other embodiments, the invention provides a nucleic acid that comprises 
5 a polynucleotide sequence selected from SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO: 10, and SEQ ID 
NO:ll. 

In other embodiments, the polypeptide encoded by any nucleic acid or the 
invention described herein or a fragment thereof may have antiproliferative activity in a 

10 human Daudi cell line - based assay, or antiviral activity in a human WISH cell/EMCV- 
based assay. In other embodiments, the encoded polypeptide has antiproliferative activity 
of at least about 8.3xl0 6 units/milligram in the human Daudi cell line - based assay (1 unit 
is the amount of protein in milligram (mg) required to induce 50% antiproliferative 
activity), or antiviral activity of at about least 2.1xl0 7 units/milligram (mg) in the human 

15 WISH cell/EMC V-based assay (1 unit is the amount of protein in mg required to induce 
50% antiviral activity). In other embodiments, the encoded polypeptide can bind to a type 
I interferon receptor, preferably a human type I interferon receptor, more preferably a 
human (e.g., type I) interferon-alpha receptor. 

The invention also includes a cell comprising any nucleic acid of the 

20 invention described herein, or which expresses any polypeptide of the invention noted 
herein. In one embodiment, the cell expresses a polypeptide encoded by the nucleic acid 
of the invention as described herein. 

The invention also includes a vector comprising any nucleic acid of the 
invention described above and below. The vector can comprise a plasmid, a cosmid, a 

25 phage, or a virus; the vector can be, e.g., an expression vector, a cloning vector, a 

packaging vector, an integration vector, or the like. The invention also includes a cell 
transduced by a vector of the invention. The invention also includes compositions 
comprising any nucleic acid of the invention described above and below, and an excipient, 
preferably a pharmaceutical^ acceptable excipient. Cells and transgenic animals which 

30 include any polypeptide or nucleic acid of the invention described above and below, e.g., 
produced by transduction of vector, are a feature of the invention. 
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The invention also includes compositions produced by digesting one or 
more of the nucleic acids of the invention described above or below with a restriction 
endonuclease, an RNAse, or a DNAse; and, compositions produced by incubating one or 
more nucleic acids described above or below in the presence of deoxyribonucelotide 
5 triphosphates and a nucleic acid polymerase, e.g., a thermostable polymerase. 

The invention also includes compositions comprising two or more nucleic 
acids described above or below. The composition may comprise a library of nucleic acids, 
where the library contains at least about 5, 10, 20 or 50 nucleic acids. 

In another aspect, the invention includes an isolated or recombinant 
10 polypeptide encoded by any nucleic acid described above or below. In one embodiment, 
the polypeptide may comprise a sequence selected from SEQ ID NO:36 to SEQ ID 
NO:70, or SEQ ID NO:79 to SEQ ID NO:85. 

The invention also includes a polypeptide comprising at least 50 contiguous 
amino acids of a protein encoded by a polynucleotide sequence, the polynucleotide 
15 sequence selected from the group consisting of: (a) SEQ ID NO:l to SEQ ID NO:35 or 
SEQ ID NO:72 to SEQ ID NO:78; (b) a polynucleotide sequence that encodes a 
polypeptide selected from SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID 
NO: 85; and (c) a complementary sequence of a polynucleotide sequence which hybridizes 
under highly stringent conditions over substantially the entire length of polynucleotide 
20 sequence (a) or (b). In various embodiments, the polypeptide comprises at least about 70, 
100, 120, 130, 140, 150, 155, 160, 165, or 166 contiguous amino acids of the encoded 
protein. 

The invention also includes an isolated or recombinant polypeptide 
comprising an amino acid sequence comprising at least 50 contiguous amino acid residues 

25 of any one of SEQ ID NOS:36-70, and one or more of amino acids Alal9, (Tyr or Gln)34, 
Gly37, Phe38, Lys71, Ala76, Tyi90, Ilel32, Argl34, Phel52, Lysl60, and Glul66, where 
the numbering of the amino acids corresponds to that of SEQ ID NO:36. In various 
embodiments, the polypeptide comprises at least about 50, 70, 75, 100, 110, 120, 130, 140 
150, 155, 160, 163, 165, or 166 contiguous amino acids of any one of SEQ ID NOS:36-70. 

30 In more preferred embodiments, the polypeptide comprises at least about 50, 70, 75, 100, 
110, 120, 130, 140, 150, 155, 160, 163, 165, or 166 contiguous amino acid residues of any 
one of SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, 
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SEQ ID NO:42, SEQ ID NO:45, or SEQ ID NO:46. In other embodiments, the 
polypeptide of the invention is at least about 50, 70, 75, 100, 110, 120, 130, 140, 150, 155, 
160, 163, 165, or 166 amino acid residues in length, or is preferably 166 amino acids in 
length. Longer polypeptides, e.g., which comprise purification tags or the like, are also 
5 contemplated. Such polypeptides may display antiproliferative activities in human Daudi 
cell-line based assay and/or antiviral activities in a human WISH cell/EMCV-based assay. 

The invention also includes a polypeptide which specifically binds 
polyclonal antisera raised against at least one antigen, said at least one antigen comprising 
a polypeptide sequence selected from an amino acid sequence set forth in SEQ ID NO:36 

10 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85 or a fragment thereof. In 

particular, the invention provides polypeptides which bind a polyclonal antisera raised 
against at least one antigen, wherein said at least one antigen comprises at least one amino 
acid sequence set forth in SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID 
NO:85, or a fragment of any of these amino sequences, wherein the polyclonal antisera is 

15 subtracted with one or more known interferon-alpha polypeptides or proteins, including, 
e.g., a polypeptide or protein encoded by a nucleic acid having or corresponding to one 
or more of the following GenBank™ accession numbers: J00210 (alpha-D), J00207 
(Alpha-a), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha- 14), 
V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 

20 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 
(Gx-1), I016I4, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 
108313 (alpha-Con 1), and other similar or homologous interferon-alpha nucleic acid 
sequences presented in GenBank. 

Any polypeptide described above or below optionally has antiproliferative 

25 activity in a human Daudi cell line - based assay and/or in an antiviral activity in a human 
WISH cell/EMCV-based assay. Any polypeptide described above or below can have 
antiproliferative activity of at least about 8.3xl0 6 units/mg in the human Daudi cell line - 
based assay or antiviral activity of at least about 2.1x10 units/mg in the human WISH 
cell/EMCV-based assay. In other embodiments, any polypeptide described above or 

30 below can bind to a type I interferon receptor, preferably a human type I interferon 
receptor, more preferably a human interferon-alpha receptor. 
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In other embodiments, any polypeptide described above or below may 
further include a secretion/localization sequence, e.g., a signal sequence, an organelle 
targeting sequence, a membrane localization sequence, and the like. Any polypeptide 
described herein may further include a sequence that facilitates purification, e.g., an 
5 epitope tag (such as, a FLAG epitope), a polyhistidine tag, a GST fusion, and the like. 
The polypeptide optionally includes a methionine at the N-terminus. Any polypeptide of 
the invention described herein optionally includes one or more modified amino acids, 
such as a glycosylated amino acid, a PEG-ylated amino acid, a farnesylated amino acid, an 
acetyl ated amino acid, a biotinylated amino acid, a carboxylated amino acid, a 

10 phosphorylated amino acid, an acylated amino acid, or the like. 

The invention also includes compositions comprising any polypeptide 
described herein in an excipient, preferably a pharmaceutic ally acceptable excipient. 

The invention also includes an antibody or antisera produced by 
administering one or more of the polypeptides of the invention described herein to a 

15 mammal, wherein the antibody or antisera does not specifically bind to a known alpha- 

interferon polypeptide or protein, including, e.g., any polypeptide or protein encoded by a 
nucleic acid having or corresponding to one or more of the following GenBank accession 
numbers: J00210 (alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), 
V00533 (alpha-H), V00542 (alpha-14), V00545 (IFN-1B), X03125 (alpha-8), X02957 

20 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), 
X02961 (alpha-10 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha- 
F), M38289, V00549 (alpha-2a), and 108313 (alpha-Conl), and other similar or 
homologous interferon-alpha sequences presented in GenBank. 

The invention also includes antibodies which specifically bind a 

25 polypeptide comprising a sequence selected from SEQ ED NO:36 to SEQ ID NO:70 or 
SEQ ID NO:79 to SEQ ID NO:85. The antibodies are, e.g., polyclonal, monoclonal, 
chimeric, humanized, single chain, Fab fragments, fragments produced by an Fab 
expression library, or the like. 

Methods for producing the polypeptides of the invention are also included. 

30 One such method comprises introducing into a population of cells any nucleic acid 
described herein, operatively linked to a regulatory sequence effective to produce the 
encoded polypeptide, culturing the cells in a culture medium to produce the polypeptide, 
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and optionally isolating the polypeptide from the cells or from the culture medium. The 
nucleic acid may be part of a vector, such as a recombinant expression vector. 

The invention also includes a method of inhibiting growth of tumor cells, 
by contacting the tumor cells with a polypeptide of the invention described herein, 
5 thereby inhibiting growth of the tumor cells. In one embodiment, the invention includes a 
method of inhibiting growth of population of tumor cells comprising contacting the 
population of tumor cells with an effective amount of a polypeptide of the invention 
sufficient to inhibit growth of tumor cells in said population of tumor cells, thereby 
inhibiting growth of tumor cells in said population of cells. In various embodiments, the 

10 tumor cells can be human carcinoma cells, human leukemia cells, human T-lymphoma 
cells, human melanoma cells, other human cancer cells as described herein, and the like. 
The tumor cells can be in vivo, ex vivo, or in vitro (e.g., cultured cells). 

The invention also includes a method of inhibiting the replication of a virus 
within one or more cells infected by the virus, by contacting one or more of the infected 

15 cells with an effective amount of a polypeptide of the invention as described above and 
below, wherein said amount is sufficient to inhibit viral replication in said one or more 
infected cells, thereby inhibiting replication of the virus in the one or more cells. In 
various embodiments, the virus can be an RNA virus, e.g., a human immunodeficiency 
virus or a hepatitis C virus, or a DNA virus, e.g., a hepatitis B virus. The infected cells 

20 can be in vivo, ex vivo, or in vitro {e.g., cultured cells). 

The invention also includes a method of treating an autoimmune disorder 
in a subject in need of such treatment, by administering to the subject an effective amount 
of a polypeptide of the invention as described herein sufficient to treat the autoimmune 
disorder. In various embodiments, the autoimmune disorder may be multiple sclerosis, 

25 rheumatoid arthritis, lupus erythematosus, type I diabetes, and the like. The invention 
also includes, in a method of treating a disorder treatable by administration of interferon- 
alpha to a subject, an improvement comprising administering to the subject an effective 
amount of a polypeptide of the invention as described herein sufficient to treat said 
disorder. The disorder treatable by administration of interferon-alpha disorder may be 

30 multiple sclerosis, rheumatoid arthritis, lupus erythematosus, type I diabetes, AIDS or 
AIDS-related complexes, or the like. 
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In general, nucleic acids and proteins derived by mutation of the sequences 
herein are a feature of the invention. Similarly, those produced by diversity generation or 
recursive sequence recombination (RSR) methods (e.g., DNA shuffling) are a feature of 
the invention. Mutation and recombination methods using the nucleic acids described 
5 herein are a feature of the invention. For example, one method of the invention includes 
recursively recombining one or more nucleic acid sequences of the invention as described 
above and below with one or more additional nucleic acids (including, but not limited to, 
those noted herein), each sequence of the one or more additional nucleic acids encoding an 
interferon-alpha homologue or an amino acid subsequence thereof. The recombining steps 

10 are optionally performed in vivo, ex vivo, in silico or in vitro. Said recursive 

recombination produces at least one library of recombinant interferon-alpha homologue 
nucleic acids. Also included in the invention are a recombinant interferon-alpha 
homologue nucleic acid produced by this method, a cell containing the recombinant 
interferon-alpha homologue nucleic acid, a nucleic acid library produced by this recursive 

15 recombination method, a composition comprising two or more of said recombinant 
interferon-alpha nucleic acids, and a population of cells comprising such recombinant 
interferon-alpha nucleic acids or containing the library. In one embodiment, the library 
comprise at least ten such recombinant nucleic acids. 

The invention also provides a method of producing a modified or 

20 recombinant interferon-alpha homologue nucleic acid that comprises mutating a nucleic 
acid of the invention as described herein. 

Also provided are nucleic acids that encode an interferon-alpha homologue 
having an increased growth inhibition activity, cytostatic activity, or cytotoxic activity 
against a population of cells (e.g., cancer cells) relative to the growth inhibition activity 

25 cytostatic activity, or cytotoxic activity, respectively, of human interferon-alpha 2a or 
other known interferon-alpha against the population of cells. 

These and other objects and features of the invention will become more 
fully apparent when the following detailed description is read in conjunction with the 
accompanying figures. 



10 



WO 01/25438 



PCT/US00/27781 



BRIEF DESCRIPTION OF THE FIGURES 

Figures 1 A-1E show an alignment of exemplary mature interferon 

homologue polypeptide sequences (SEQ ID NOS: 36-70 and 79-85) according to the 
invention. 

5 Figure 2 shows antiproliferative activities in a human Daudi cell line - 

based assay and antiviral activities in a human WISH cell/EMCV-based assay of, 
respectively, exemplary interferon homologues of the present invention relative to the 
respective antiproliferative and antiviral activities of two control compounds, human 
interferon alpha-2a ("IFN-a-2a" or "2a") and consensus human interferon ("IFN-Conl" or 
10 "Conl"). 

Figures 3A, 3B, and 3C illustrate activity profiles of IFN-alpha homologue 
3DA1 1 (SEQ ID NO:40) and control interferons, human interferon alpha-2a ( "2a") and 
consensus human interferon alpha ( "Conl"). against a panel of tumor cell lines. Fig. 3A 
shows the cell total growth inhibitory activity of IFN-alpha homologue 3DA1 1 and each 

15 control IFN on each respective cell line as reflected in the GI50 value, which is the 

concentration (^ig/ml) of interferon alpha homologue or control IFN alpha at which growth 
of a particular cell line is inhibited by 50%, as measured by a 50% reduction in the net 
protein/polypeptide increase in the interferon alpha homologue or control IFN alpha at the 
end of the incubation period. 

20 Fig. 3B shows the cytostatic activity of IFN-alpha homologue 3DA1 1 and 

each control BFN on each cell line of the panel of cell lines. Cytostatic activity refers to an 
activity capable of suppressing growth and multiplication of cells. Cytostatic activity is 
assessed as a reflection of the concentration of IFN-alpha homologue 3DA1I or control 
IFN (ug/ml) at which the growth and/or multiplication of cells of a particular cell line is 

25 completely inhibited or suppressed, such that the amount of cellular protein at the end of 
the incubation period equals the amount of cellular protein at the beginning of the 
incubation period ("total growth inhibition" or 'TGI"). 

Fig. 3C illustrates the cytotoxic activity of IFN-alpha homologue 3DA1 1 
and each control IFN on each respective cell line. The cytotoxicity of an agent (e.g., an 

30 IFN homologue or EFN compound) is the degree to which the agent possess a specific 
destructive action on certain cells or the possession of such action. The term typically 
refers to an agent capable of causing cell death and is used particularly in referring to the 
lysis of cells by immune phenomena and to agents of compounds that selectively kill 
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dividing cells. In Fig. 3C, cytotoxic activity is illustrated as LC50, the concentration of 
IFN-alpha homologue 3 DAI 1 (ng/ml) at which a 50% reduction in the net protein increase 
in control cells (control BFN alpha) at the end of the incubation as compared to that at the 
beginning of the incubation period is observed, indicating a net loss of cells following 
5 addition of the particular interferon. Cytotoxic activity may be assessed as the 

concentration of IFN-alpha homologue 3DA11 at which, relative to the control cells, 50% 
of the total number of cells (i.e., total population) of a particular cell line are destroyed or 
killed. 

Figs. 4A, 4B, 4C, and 4D show the cytostatic activity of selected 

10 interferon-alpha homologues of the present invention relative to the cytostatic activities of 
two control interferon alphas, human interferon-alpha 2a ("2a") and consensus human 
interferon-alpha ("Conl"), against a leukemia cell line (RPMI-8226) (Fig. 4 A), a lung 
cancer cell line (NCI-H23) (Fig. 4B), a renal cancer cell line (ACHN) (Fig. 4C), and an 
ovarian cancer cell line (OVCAR-3) (Fig. 4D), respectively. Cytostatic activity is 

15 reflected by a TGI value for a particular interferon alpha (i.e., the concentration of 

interferon alpha at which cell growth of a cell line is totally inhibited, wherein the amount 
of cellular protein at the end of the incubation period equals the amount of cellular protein 
at the beginning of the incubation period). 

Fig. 5 presents a comparison of the number of mice (out of a total number 

20 of six mice) that survived following administration of doses of 2 jig, 10 |ig, and 50 ug of 
two exemplary IFN-alpha homologues of the present invention (designated "IFN-CH2.2" 
and "IFN-CH2.3"), doses of 2 u£, 10 ug, and 50 |xg of murine IFN-alpha-4, and doses of 2 
ug, 10 (ig, and 50 \ig of human IFN-alpha-2a, respectively. The results shown in Fig. 5 
demonstrate that in a murine model system, the improved in vitro antiviral activity of these 

25 two exemplary IFN-alpha homologues is maintained and sustained in vivo. Phosphate- 
buffered saline (PBS) is used as a control. 

DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

30 Unless otherwise defined herein or below in the remainder of the 

specification, all technical and scientific terms used herein have the same meaning as 
commonly understood by those of ordinary skill in the art to which the present invention 
belongs. 
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A "polynucleotide sequence" is a nucleic acid (which is a polymer of 
nucleotides (A,C,T,U,G, etc. or naturally occurring or artificial nucleotide analogues)) or a 
character string representing a nucleic acid, depending on context. Either the given 
nucleic acid or the complementary nucleic acid can be determined from any specified 
5 polynucleotide sequence. 

Similarly, an "amino acid sequence" is a polymer of amino acids (a protein, 
polypeptide, etc.) or a character string representing an amino acid polymer, depending on 
context. Either the given nucleic acid or the complementary nucleic acid can be 
determined from any specified polynucleotide sequence. 

10 A nucleic acid, protein, peptide, polypeptide, or other component is 

"isolated" when it is partially or completely separated from components with which it is 
normally associated (other peptides, polypeptides, proteins (including complexes, e.g., 
polymerases and ribosomes which may accompany a native sequence), nucleic acids, 
cells, synthetic reagents, cellular contaminants, cellular components, etc.), e.g., such as 

15 from other components with which it is normally associated in the cell from which it was 
originally derived. A nucleic acid, polypeptide, or other component is isolated when it is 
partially or completely recovered or separated from other components of its natural 
environment such that it is the predominant species present in a composition, mixture, or 
collection of components (i.e., on a molar basis it is more abundant than any other 

20 individual species in the composition). In preferred embodiments, the preparation consists 
of more than 70%, typically more than 80%, or preferably more than 90% of the isolated 
species. 

In one aspect, a "substantially pure" or "isolated" nucleic acid (e.g., RNA or 
DNA), polypeptide, protein, or composition also means where the object species (e.g., 

25 nucleic acid or polypeptide) comprises at least about 50, 60, or 70 percent by weight (on a 
molar basis) of all macromolecular species present. A substantially pure or isolated 
composition can also comprise at least about 80, 90, or 95 percent by weight of all 
macromolecular species present in the composition. An isolated object species can also be 
purified to essential homogeneity (contaminant species cannot be detected in the 

30 composition by conventional detection methods) wherein the composition consists 
essentially of derivatives of a single macromolecular species. 
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The term "isolated nucleic acid" may refer to a nucleic acid (e.g., DNA or 
RNA) that is not immediately contiguous with both of the coding sequences with which it 
is immediately contiguous (i.e., one at the 5' and one at the 3 f end) in the naturally 
occurring genome of the organism from which the nucleic acid of the invention is derived. 
5 Thus, this term includes, e.g., a cDNA or a genomic DNA fragment produced by 

polymerase chain reaction (PCR) or restriction endonuclease treatment, whether such 
cDNA or genomic DNA fragment is incorporated into a vector, integrated into the genome 
of the same or a different species than the organism, including, e.g., a virus, from which it 
was originally derived, Jinked to an additional coding sequence to form a hybrid gene 

10 encoding a chimeric polypeptide, or independent of any other DNA sequences. The DNA 
may be double-stranded or single-stranded, sense or antisense. 

A nucleic acid or polypeptide is "recombinant" when it is artificial or 
engineered, or derived from an artificial or engineered protein or nucleic acid. The term 
"recombinant" when used with reference e.g., to a cell, nucleotide, vector, or polypeptide 

15 typically indicates that the cell, nucleotide, or vector has been modified by the introduction 
of a heterologous (or foreign) nucleic acid or the alteration of a native nucleic acid, or that 
the polypeptide has been modified by the introduction of a heterologous amino acid, or 
that the cell is derived from a cell so modified. Recombinant cells express nucleic acid 
sequences (e.g., genes) that are not found in the native (non-recombinant) form of the cell 

20 or express native nucleic acid sequences (e.g., genes) that would be abnormally expressed 
under-expressed, or not expressed at all. The term "recombinant nucleic acid" (e.g., DNA 
or RNA) molecule means, for example, a nucleotide sequence that is not naturally 
occurring or is made by the combatant (for example, artificial combination) of at least two 
segments of sequence that are not typically included together, not typically associated with 

25 one another, or are otherwise typically separated from one another. A recombinant nucleic 
acid can comprise a nucleic acid molecule formed by the joining together or combination 
of nucleic acid segments from different sources and/or artificially synthesized. The term 
"recombinantly produced" refers to an artificial combination usually accomplished by 
either chemical synthesis means, recursive sequence recombination of nucleic acid 

30 segments or other diversity generation methods (such as, e.g., shuffling) of nucleotides, or 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
known to those of ordinary skill in the art. "Recombinantly expressed" typically refers to 
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techniques for the production of a recombinant nucleic acid in vitro and transfer of the 
recombinant nucleic acid into cells in vivo, in vitro, or ex vivo where it may be expressed 
or propagated. A "recombinant polypeptide" or "recombinant protein" usually refers to 
polypeptide or protein, respectively, that results from a cloned or recombinant gene or 
5 nucleic acid. 

A "subsequence" or "fragment" is any portion of an entire sequence, up to 
and including the complete sequence. 

Numbering of a given amino acid or nucleotide polymer "corresponds to 
numbering" of a selected amino acid polymer or nucleic acid when the position of any 

10 given polymer component (amino acid residue, incorporated nucleotide, etc) is designated 
by reference to the same residue position in the selected amino acid or nucleotide, rather 
than by the actual position of the component in the given polymer. 

A vector is a composition for facilitating cell transduction by a selected 
nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., plasmids, 

15 cosmids, viruses, YACs, bacteria, poly-lysine, etc. An "expression vector" is a nucleic 
acid construct, generated recombinantly or synthetically, with a series of specific nucleic 
acid elements that permit transcription of a particular nucleic acid in a host cell. The 
expression vector can be part of a plasmid, virus, or nucleic acid fragment. The 
expression vector typically includes a nucleic acid to be transcribed operably linked to a 

20 promoter. 

"Substantially an entire length of a polynucleotide or amino acid sequence" 
refers to at least about 50%, at least about 60%, generally at least about 70%, generally at 
least about 80%, or typically at least about 90%, 95,%, 96%, 97%, 98%, or 99% or more 
of a length of an amino acid sequence or nucleic acid sequence. 

25 "A human alpha-interferon receptor" is a receptor which is naturally 

activated in human cells by an alpha interferon. 

"Naturally occurring" as applied to an object refers to the fact that the 
object can be found in nature. For example, a polypeptide or polynucleotide sequence that 
is present in an organism, including viruses, that can be isolated from a source in nature 

30 and which has not been intentionally modified by man in the laboratory is naturally 
occurring. In one aspect, a "naturally occurring" nucleic acid {e.g., DNA orRNA) 
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molecule is a nucleic acid molecule that exists in the same state as it exists in nature; that 
is, the nucleic acid molecule is not isolated, recombinant, or cloned. 

As used herein, an "antibody" refers to a protein comprising one or more 
polypeptides substantially or partially encoded by immunoglobulin genes or fragments of 
5 immunoglobulin genes. The recognized immunoglobulin genes include the kappa, 
lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or 
lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. A typical 

10 immunoglobulin (e.g., antibody) structural unit comprises a tetramer. Each tetramer is 

composed of two identical pairs of polypeptide chains, each pair having one "light" (about 
25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a 
variable region of about 100 to 1 10 or more amino acids primarily responsible for antigen 
recognition. The terms variable light chain (VL) and variable heavy chain (VH) refer to 

15 these light and heavy chains, respectively. Antibodies exist as intact immunoglobulins or 
as a number of well characterized fragments produced by digestion with various 
peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in 
the hinge region to produce F(ab)*2, a dimer of Fab which itself is a light chain joined to 
VH-CH1 by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break 

20 the disulfide linkage in the hinge region thereby converting the (Fab')2 dimer into an Fab' 
monomer. The Fab' monomer is essentially an Fab with part of the hinge region (see 
Fundamental Immunology, W.E. Paul, ed., Raven Press, N. Y. (1993), for a more detailed 
description of other antibody fragments). While various antibody fragments are defined in 
terms of the digestion of an intact antibody, one of skill will appreciate that such Fab' 

25 fragments may be synthesized de novo either chemically or by utilizing recombinant DNA 
methodology. Thus, the term antibody, as used herein also includes antibody fragments 
either produced by the modification of whole antibodies or synthesized de novo using 
recombinant DNA methodologies. Antibodies include single chain antibodies, including 
single chain Fv (sFv) antibodies in which a variable heavy and a variable light chain are 

30 joined together (directly or through a peptide linker) to form a continuous polypeptide. 

An "antigen-binding fragment" of an antibody is a peptide or polypeptide 
fragment of the antibody which binds an antigen. An antigen-binding site is formed by 
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those amino acids of the antibody which contribute to, are involved in, or affect the 
binding of the antigen. See Scott, T.A. and Mercer, E.I., Concise Encyclopedia: 
Biochemistry and Molecular Biology (de Gruyter, 3d ed. 1997) [hereinafter "Scott, 
Concise Encyclopedia"] and Watson, J.D. et al., Recombinant DN A (2d ed. 1992) 
5 [hereinafter "Watson, Recombinant DNA"], each of which is incorporated herein by 
reference in its entirety for all purposes. 

An "immunogen" refers to a substance that is capable of provoking an 
immune response. Examples of immunogens include, e.g., antigens, autoantigens that 
play a role in induction of autoimmune diseases, and tumor-associated antigens expressed 

10 on cancer cells. 

An "antigen" is a substance that is capable of eliciting the formation of 
antibodies in a host or generating a specific population of lymphocytes reactive with that 
substance. Antigens are typically macromolecules (e.g., proteins and polysaccharides) 
that are foreign to the host. 

15 The term "immunoassay" includes an assay that uses an antibody or 

immunogen to bind or specifically bind an antigen. The immunoassay is typically 
characterized by the use of specific binding properties of a particular antibody to isolate, 
target, and /or quantify the antigen. 

The term "homology" generally refers to the degree of similarity between 

20 two or more structures. The term "homologous sequences" refers to regions in 

macromolecules that have a similar order of monomers. When used in relation to nucleic 
acid sequences, the term "homology" refers to the degree of similarity between two or 
more nucleic acid sequences {e.g., genes) or fragments thereof. Typically, the degree of 
similarity between two or more nucleic acid sequences refers to the degree of similarity of 

25 the composition, order, or arrangement of two or more nucleotide bases (or other 

genotypic feature) of the two or more nucleic acid sequences. The term "homologous 
nucleic acids" generally refers to nucleic acids comprising nucleotide sequences having a 
degree of similarity in nucleotide base composition, arrangement, or order. The two or 
more nucleic acids may be of the same or different species or group. The term "percent 

30 homology" when used in relation to nucleic acid sequences, refers generally to a percent 
degree of similarity between the nucleotide sequences of two or more nucleic acids. 
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When used in relation to polypeptide (or protein) sequences, the term 
"homology" refers to the degree of similarity between two or more polypeptide (or 
protein) sequences (e.g., genes) or fragments thereof. Typically, the degree of similarity 
between two or more polypeptide (or protein) sequences refers to the degree of similarity 
5 of the composition, order, or arrangement of two or more amino acid of the two or more 
polypeptides (or proteins). The two or more polypeptides (or proteins) may be of the same 
or different species or group. The term "percent homology" when used in relation to 
polypeptide (or protein) sequences, refers generally to a percent degree of similarity 
between the amino acid sequences of two or more polypeptide (or protein) sequences. The 

10 term "homologous polypeptides" or "homologous proteins" generally refers to 

polypeptides or proteins, respectively, that have amino acid sequences and functions that 
are similar. Such homologous polypeptides or proteins may be related by having amino 
acid sequences and functions that are similar, but are derived or evolved from different or 
the same species using the techniques described herein. 

15 The term "subject" as used herein includes, but is not limited to, an 

organism; a mammal, including, e.g., a human, non-human primate (e.g., monkey), mouse, 
pig, cow, goat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non-human 
mammal; a non-mammal, including, e.g., a non-mammalian vertebrate, such as a bird 
(e.g., a chicken or duck) or a fish; and a non-mammalian invertebrate. 

20 The term "pharmaceutical composition" means a composition suitable for 

pharmaceutical use in a subject, including an animal or human. A pharmaceutical 
composition generally comprises an effective amount of an active agent and a 
pharmaceutically acceptable carrier. 

The term "effective amount" means a dosage or amount sufficient to 

25 produce a desired result. The desired result may comprise an objective or subjective 
improvement in the recipient of the dosage or amount. 

A "prophylactic treatment" is a treatment administered to a subject who 
does not display signs or symptoms of a disease, pathology, or medical disorder, or 
displays only early signs or symptoms of a disease, pathology, or disorder, such that 

30 treatment is administered for the purpose of diminishing, preventing, or decreasing the risk 
of developing the disease, pathology, or medical disorder. A prophylactic treatment 
functions as a preventative treatment against a disease or disorder. A "prophylactic 
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activity" is an activity of an agent, such as a nucleic acid, vector, gene, polypeptide, 
protein, substance, composition thereof that, when administered to a subject who does not 
display signs or symptoms of pathology, disease or disorder, or who displays only early 
signs or symptoms of pathology, disease, or disorder, diminishes, prevents, or decreases 
5 the risk of the subject developing a pathology, disease, or disorder. A "prophylactically 
useful" agent or compound (e.g., nucleic acid or polypeptide) refers to an agent or 
compound that is useful in diminishing, preventing, treating, or decreasing development of 
pathology, disease or disorder. 

A "therapeutic treatment" is a treatment administered to a subject who 

10 displays symptoms or signs of pathology, disease, or disorder, in which treatment is 

administered to the subject for the purpose of diminishing or eliminating those signs or 
symptoms of pathology, disease, or disorder. A "therapeutic activity" is an activity of an 
agent, such as a nucleic acid, vector, gene, polypeptide, protein, substance, or composition 
thereof, that eliminates or diminishes signs or symptoms of pathology, disease or disorder, 

15 diminishes when administered to a subject suffering from such signs or symptoms. A 
"therapeutically useful" agent or compound (e.g., nucleic acid or polypeptide) indicates 
that an agent or compound is useful in diminishing, treating, or eliminating such signs or 
symptoms of a pathology, disease or disorder. 

The term "gene" broadly refers to any segment of DNA associated with a 

20 biological function. Genes include coding sequences and/or regulatory sequences required 
for their expression. Genes also include non-expressed DNA nucleic acid segments that, 
e.g., form recognition sequences for other proteins. 

Generally, the nomenclature used hereafter and the laboratory procedures 
in cell culture, molecular genetics, molecular biology, nucleic acid chemistry, and protein 

25 chemistry described below are those well known and commonly employed by those of 
ordinary skill in the art. Standard techniques, such as described in Sambrook et al., 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989 (hereinafter "Sambrook") and Current 
Protocols in Molecular Biology, F.M. Ausubel et al., eds., Current Protocols, a joint 

30 venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. 

(supplemented through 1999) (hereinafter "Ausubel"), are used for recombinant nucleic 
acid methods, nucleic acid synthesis, cell culture methods, and transgene incorporation, 
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e.g., electroporation, injection, and lipofection. Generally, oligonucleotide synthesis and 
purification steps are performed according to specifications. The techniques and 
procedures are generally performed according to conventional methods in the art and 
various general references which are provided throughout this document. The procedures 
5 therein are believed to be well known to those of ordinary skill in the art and are provided 
for the convenience of the reader. 

A variety of additional terms are defined or otherwise characterized herein. 

POLYNUCLEOTIDES OF THE INVENTION 

Interferon-alpha Homologue Sequences 
10 The invention provides isolated or recombinant interferon-alpha homologue 

polypeptides, and isolated or recombinant polynucleotides encoding the polypeptides. 

As described in more detail below, in accordance with the present 

invention, polynucleotide sequences which encode novel interferon-alpha homologue 

polypeptides, nucleotide sequences (e.g., subsequences) that encode fragments of 

15 interferon-alpha homologue polypeptides, and nucleotide sequences that encode related 
fusion polypeptides or proteins, or functional equivalents thereof, are collectively referred 
to herein as "interferon-alpha homologues," "interferon homologue nucleic acids," "IFN- 
alpha homologues," "IFN homologues," "IFN nucleic acids," "interferon homologues," 
"interferon nucleic acids, " "recombinant interferon-alpha," "recombinant interferon-alpha 

20 nucleic acids," "nucleic acids of the invention," "polynucleotides of the invention," or 
"nucleotides of the invention." Polynucleotide, nucleotide are nucleic acid fragments of 
each of the preceding terms are also intended to be included and encompassed in 
polynucleotides, nucleotides, and nucleic acids of the invention. The term "nucleic acid" 
is used interchangeable with the term "nucleotide." 

25 Polynucleotides encoding the polypeptides of the invention were 

discovered in libraries of shuffled interferon-alpha related sequences. The library 
members were screened for antiproliferative activity against human tumor cell lines and, 
in some cases, assayed for antiviral activity against virus-infected human cells. A subset 
of the sequences provided herein were discovered in shuffled libraries screened for 

30 antiviral activity against virus-infected mouse cells. Coding sequences for interferon 
homologues were identified as described in the examples. 
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Briefly, libraries of shuffled mature interferon-alpha coding sequences were 
introduced into E. coli. Colonies were screened in a high-throughput antiproliferative 
activity assay against a human Daudi tumor cell line as described in Example 1, and 
colonies expressing active polypeptides were selected, re-screened, and expression levels 
5 determined. DNA from selected colonies was isolated and re-shuffled to create secondary 
libraries. The secondary libraries were introduced into E. coli and screened for 
antiproliferative activity in the human Daudi cell line-based cell proliferation assay. DNA 
from colonies selected from the primary and secondary library screens were transduced 
into Chinese hamster ovary (CHO) cells, and stable cell lines were generated. CHO- 

10 expressed proteins were purified, quantitated, and assayed for antiproliferative activity 
using the human Daudi cell line, and optionally, for antiviral activity using 
encephalomyocarditis virus (EMCV)-infected human WISH cells, as described in 
Example 1. Exemplary shuffled nucleic acids which encode interferon-alpha homologue 
polypeptides having antiproliferative activity in the human Daudi cell line-based assay are 

15 identified herein as SEQ ID NO:l to SEQ ID NO:35, which encode mature interferon- 
alpha homologue polypeptides identified herein as SEQ ID NO:36 to SEQ ID NO:70, 
respectively. Libraries of shuffled mature interferon-alpha coding sequences were also 
screened in a high-throughput antiviral activity screen against EMCV-infected mouse 
cells. Exemplary shuffled nucleic acids which encode polypeptides having antiviral 

20 activity in the murine cell /EMCV-based assay are identified herein as SEQ ID NO:72 to 
SEQ ID NO:78, which encode mature interferon homologue polypeptides identified 
herein as SEQ ID NO:79 to SEQ ID NO:85. 

In another aspect, the invention provides an isolated or recombinant nucleic 
acid that comprises a polynucleotide sequence selected from the group of: (a) SEQ ID 

25 NO: 1 to SEQ ID NO:35, or a complementary polynucleotide sequence thereof; (b) a 

polynucleotide sequence encoding a polypeptide selected from SEQ ID NO:36 to SEQ ID 
NO:71, or a complementary polynucleotide sequence thereof; (c) a polynucleotide 
sequence which hybridizes under at least stringent or at least highly stringent hybridization 
conditions (or ultra-high stringent or ultra-ultra- high stringent hybridization conditions) 

30 over substantially the entire length of polynucleotide sequence (a) or (b), or with a 50, 
120, 130, 140, 145, 150, 155, 160, or 165 nucleotide base subsequence or fragment of a 
polynucleotide sequence of (a) or (b); and (d) a polynucleotide sequence comprising a 
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fragment of (a), (b), or (c), which fragment encodes all or a part of a polypeptide having 
an antiproliferative activity in a human Daudi cell line-based assay or an antiviral activity 
in an assay known in the art for measuring antiviral activity. 

In another aspect, the invention provides an isolated or recombinant nucleic 
5 acid that comprises a polynucleotide sequence selected from the group of: (a) SEQ ID 
NO:72 to SEQ ID NO:78, or a complementary polynucleotide sequence thereof; (b) a 
polynucleotide sequence encoding a polypeptide selected from SEQ ID NO:79 to SEQ ID 
NO:85, or a complementary polynucleotide sequence thereof; (c) a polynucleotide 
sequence which hybridizes under at least stringent or at least highly stringent hybridization 

10 conditions (or ultra-high stringent or ultra-ultra- high stringent hybridization conditions) 
over substantially the entire length of polynucleotide sequence (a) or (b), or with a 50, 
120, 130, 140, 145, 150, 155, 160, or 165 nucleotide base subsequence or fragment of a 
polynucleotide sequence of (a) or (b); and (d) a polynucleotide sequence comprising a 
fragment of (a), (b), or (c), which fragment encodes all or a part of a polypeptide having 

15 an antiproliferative activity in a human Daudi cell line-based assay or an antiviral activity 
in a murine cell line/EMCV-based assay. 

The present invention also includes a mature interferon-alpha homologue 
polypeptide comprising the amino acid identified herein as SEQ ID NO:71 and a 
polynucleotide sequence encoding said polypeptide or a fragment of said polypeptide 

20 having an antiproliferative activity in the human Daudi cell line-based assay and/or an 
antiviral activity in the murine cell /EMCV-based assay. 

The invention also includes an isolated or recombinant nucleic acid 
comprising a polynucleotide sequence encoding a polypeptide, wherein the polypeptide 
comprises the amino acid sequence: CDLPQTHSLG-X11-X12-RA-X15-X16-LL-X19-QM- 

25 X 2 2-R-X24-S-X 2 6-FSCLKDR-X34-DFG^ 
X55-X 5 6-X57-HE-X 6 o-X 61 -QQTFN-X^^ 
X 8 5-X86-T-X 88 -L-X 9 o-^^ 

Xi ,4-D-X, 1 6-ILAV-X,2i-KY-Xi24-QRITLYL-X 1 32-E-Xi34-KYSPC-X,40- 
WEVVRAEJMRSFSFSTNLQKRLRRKE, or a conservatively substituted variation 
30 thereof, where X n is N or D; X u is R, S, or K; X15 is L or M; Xi 6 is I, M, or V; X !9 is A or 
G; X 22 is G or R; X 24 is I or T; X 26 is P or H; X34 is H, Y or Q; X 38 is F or L; X40 is Q or R; 
X45 is G or S; X46 is N or H; X47 is Q or R; X 50 is K or R; X51 is A or T; X 55 is S or F; X 56 
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is V or A; X 57 is L or F; X^o is M or I; X 6 i is I or M; is L or F; X72 is D or N; X75 is A 
or V; X 76 is A or T; X 78 is E or D; X 79 is Q or E; X 80 is S, R, T, or N; X 83 is E or D; X 85 is 
F or L; X 86 is S or Y; X 88 is E or G; X90 is Y, H, N; X 95 is D, E, or N; X 10 i is I, M, or V; 
X| 0 3 is E or G; X105 is G or W; X 106 is V or M; X 107 is E, G, or K; Xi 08 is E or G; Xi u is V, 
5 E, or G; X n6 is S or P; X, 2 i is K or R; X 12 4 is F or L; X l32 is T, I, or M; X134 is K or R; and 
Xuo is A or S. Each of the single letters of this amino acid sequence represents a 
particular amino acid residue according to standard practice known to those of ordinary 
skill in the art. Such polypeptides having an antiproliferative activity in the human Daudi 
cell line-based assay (e.g., at least about 8.3xl0 6 units/mg) and/or an antiviral activities in 

10 a human WISH cell/EMCV-based assay (at least about 2.1xl0 7 units/mg). 

As described in greater detail below, the polynucleotides of the invention 
are useful in for a variety of applications, including, but not limited to, as therapeutic and 
prophylactic agents in methods of in vivo and ex vivo treatment of a variety of diseases, 
disorders, and conditions in a variety of subjects; for use in in vitro methods, such as 

15 diagnostic methods, to detect, diagnose, and treat a variety of diseases, disorders, and 
conditions in a variety of subjects; for use in, e.g., gene therapy; as therapeutics and 
prophylactics, e.g., for use in methods of therapeutic and prophylactic treatment of a 
disease, disorder or condition; as immunogens; for use in diagnostic and screening assays; 
and as diagnostic probes for the presence of complementary or partially complementary 

20 nucleic acids (including for detection of IFN-alpha coding nucleic acids). 

Making Polynucleotides of the Invention 

Polynucleotides and oligonucleotides of the invention can be prepared by 
standard solid-phase methods, according to known synthetic methods. Typically, 
fragments of up to about 20, 30, 40, 50, 60, 70, 80, 90, and/or 100 nucleotide bases are 

25 individually synthesized, then joined (e.g., by enzymatic or chemical ligation methods, or 
polymerase mediated recombination methods) to form essentially any desired continuous 
sequence. In another aspect, nucleotide fragments of greater than 100 nucleotide bases 
(e.g., 150, 180, 200, 210, 240, 270, 300, 330, 360, 390, 400, 420, 450, 465, 474, 470, 475, 
489, 490, 495, 496 bases) are individually synthesized, then joined (e.g., by enzymatic or 

30 chemical ligation methods, or polymerase mediated recombination methods) to form 
essentially any desired continuous sequence, example, the polynucleotides and 
oligonucleotides of the invention, including fragments thereof (and those as described 
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herein), can be prepared by chemical synthesis using, e.g., the classical phosphoramidite 
method described by Beaucage et al (1981) Tetrahedron Letters 22: 1859-69, or the 
method described by Matthes et al (1984) EMBO J. 3:801-05., e.g. y as is typically 
practiced in automated synthetic methods. According to the phosphoramidite method, 
5 oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 

In addition, essentially any nucleic acid can be custom ordered from any of 
a variety of commercial sources, such as The Midland Certified Reagent Company 
(mcrc@oligos.com), The Great American Gene Company (http://www.gencoxom), 
10 ExpressGen Inc. (www.expressgen.com), Operon Technologies Inc. (Alameda, CA) and 
many others. Similarly, peptides and antibodies can be custom ordered from any of a 
variety of sources, such as PeptidoGenic (pkim@ccnet.com), HTI Bio-products, inc. 
(http://www.htibio.com), BMA Biomedicals Ltd. (U.K.), Bio.Synthesis, Inc., and many 
others. 

15 Certain polynucleotides of the invention may also obtained by screening 

cDNA libraries (e.g., libraries generated by recombining homologous nucleic acids as in 
typical diversity generation methods, such as, e.g., shuffling methods) using 
oligonucleotide probes which can hybridize to or PCR-amplify polynucleotides which 
encode the interferon homologue polypeptides and fragments of those polypeptides. 

20 Procedures for screening and isolating cDNA clones are well-known to those of skill in 
the art. Such techniques are described in, for example, Sambrook et al y Molecular 
Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York, 1989 (hereinafter "Sambrook") and Current Protocols in 
Molecular Biology, F.M. Ausubel et al y eds., Current Protocols, a joint venture between 

25 Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (supplemented through 
1999) (hereinafter "Ausubel"). 

As described in more detail herein, the polynucleotides of the invention 
include sequences which encode novel mature interferon-alpha homologues and sequences 
complementary to the coding sequences, and novel fragments of such coding sequences 

30 and complements thereof. The polynucleotides can be in the form of RNA or in the form 
of DNA, and include mRNA, cRNA, synthetic RNA and DNA, and cDNA. The 
polynucleotides can be double-stranded or single-stranded, and if single-stranded, can be 
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the coding strand or the non-coding (anti-sense, complementary) strand. The 
polynucleotides optionally include the coding sequence of an interferon-alpha homologue 
(i) in isolation, (ii) in combination with additional coding sequence, so as to encode, e.g., a 
fusion protein, a pre-protein, a prepro-protein, or the like, (iii) in combination with non- 
5 coding sequences, such as introns, control elements such as a promoter, a terminator 
element, or 5' and/or 3' untranslated regions effective for expression of the coding 
sequence in a suitable host, and/or (iv) in a vector or host environment in which the 
interferon-alpha homologue coding sequence is a heterologous nucleic acid sequence or 
gene. Sequences can also be found in combination with typical compositional 

10 formulations of nucleic acids, including in the presence of carriers, buffers, adjuvants, 
excipients and the like. 

The term DNA or RNA encoding the respective interferon-alpha 
homologue polypeptide includes any oligodeoxynucleotide or oligodeoxyribonucleotide 
sequence which, upon expression in an appropriate host cell, results in production of an 

15 interferon-alpha homologue polypeptide of the invention. The DNA or RNA can be 

produced in an appropriate host cell, or in a cell-free (in vitro) system, or can be produced 
synthetically (e.g., by an amplification technique such as PCR) or chemically. 

Using Polynucleotides of the Invention 

The polynucleotides of the invention have a variety of uses in, for example: 
20 recombinant production (i.e., expression) of the interferon-alpha homologue polypeptides 
of the invention; as therapeutics and prophylactics, e.g., for use in methods of therapeutic 
and prophylactic treatment of a disease, disorder or condition; for use in, gene therapy 
methods and related applications;; as immunogens; for use in diagnostic and screening 
assays; as diagnostic probes for the presence of complementary or partially 
25 complementary nucleic acids (including for detection of natural IFN- alpha coding 
nucleic acids); as substrates for further reactions, e.g., shuffling reactions or mutation 
reactions to produce new and/or improved IFN-alpha homologues, and the like. 

EXPRESSION OF POLYPEPTIDES 

In accordance with the present invention, polynucleotide sequences which 

30 encode novel and/or mature interferon-alpha homologues, fragments of interferon-alpha 

proteins, related fusion proteins, or functional equivalents thereof, are collectively referred 

to herein as "interferon-alpha homologue polypeptides," "interferon-alpha homologue 
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proteins," or "interferon-alpha homologues," "interferon homologues," "IFN-alpha 
homologues," "IFN homologues", "IFN polypeptides," "IFN proteins" "polypeptides of the 
invention," or "proteins of the invention." Polypeptide or amino acid fragments of each of 
the preceding terms are also intended to be included and encompassed in the polypeptides 
5 or proteins of the invention. Such polynucleotide sequences of the invention are used in 
recombinant DNA (or RNA) molecules that direct the expression of the interferon-alpha 
homologue polypeptides in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other nucleic acid sequences which encode substantially the same or a 
functionally equivalent amino acid sequence are also used to clone and express the 
10 interferon homologues. 

Modified Coding Sequences 

As will be understood by those of skill in the art, it can be advantageous to 
modify a coding sequence (including, e.g., a nucleotide sequence encoding an interferon- 
alpha homologue of the invention or a fragment thereof) to enhance its expression in a 

15 particular host. The genetic code is redundant with 64 possible codons, but most 

organisms preferentially use a subset of these codons. The codons that are utilized most 
often in a species are called optimal codons, and those not utilized very often are classified 
as rare or low-usage codons (see, e.g., Zhang S.P. et al. (1991) Gene 105:61-72). Codons 
can be substituted to reflect the preferred codon usage of the host, a process called "codon 

20 optimization" or "controlling for species codon bias." 

Optimized coding sequence containing codons preferred by a particular 
prokaryotic or eukaryotic host (see also Murray, E. et al. (1989) Nuc. Acids Res. 17:477- 
508) can be prepared, for example, to increase the rate of translation or to produce 
recombinant RNA transcripts having desirable properties, such as a longer half-life, as 

25 compared with transcripts produced from a non-optimized sequence. Translation stop 
codons can also be modified to reflect host preference. For example, preferred stop 
codons for S. cerevisiae and mammals are UAA and UGA, respectively. The preferred 
stop codon for monocotyledonous plants is UGA, whereas insects and E. coli prefer to use 
UAA as the stop codon (Dalphin M.E. et al. (1996) Nuc. Acids Res. 24:216-218). 

30 The polynucleotide sequences of the present invention can be engineered in 

order to alter an interferon homologue coding sequence for a variety of reasons, including 
but not limited to, alterations which modify the cloning, processing and/or expression of 
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the gene product. For example, alterations may be introduced using techniques which are 
well known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, to alter 
glycosylation patterns, to change codon preference, to introduce splice sites, etc. 

Vectors. Promoters and Expression Systems 
5 The present invention also includes recombinant constructs comprising one 

or more of the nucleic acid sequences as broadly described herein ( e.g., those encoding an 

interferon -alpha homologue of the invention or a fragment thereof). The constructs 

comprise a vector, such as, a plasmid, a cosmid, a phage, a virus (including a retrovirus), a 

bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), and the like, 

10 into which a nucleic acid sequence of the invention has been inserted, in a forward or 
reverse orientation. In a preferred aspect of this embodiment, the construct further 
comprises regulatory sequences, including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those of skill in 
the art, and are commercially available. 

15 General texts which describe molecular biological techniques useful herein, 

including the use of vectors, promoters and many other relevant topics, include Juo, P-S., 

Concise Dictionary of Biomedical and Molecular Biology (CRC Press 1996); 
Singleton et al, Dictionary of Microbiology and Molecular Biology (2d ed. 1994); 
The Cambridge Dictionary of Science and Technology (Walker ed., 1988); Hale & 

20 Marham, The Harper Collins Dictionary of Biology ( 1 99 1 ); Scott and Mercer, 
Concise Encyclopedia of Biochemistry and Molecular Biology (3d ed. 1997); 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, 
volume 152 Academic Press, Inc., San Diego, CA (hereinafter "Berger"); Sambrook et al, 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor 

25 Laboratory, Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in 
Molecular Biology, F.M. Ausubel et al., eds., Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (supplemented through 
1999) ("Ausubel")). Examples of techniques sufficient to direct persons of skill through 
in vitro amplification methods, including the polymerase chain reaction (PCR), the ligase 

30 chain reaction (LCR), QP-replicase amplification and other RNA polymerase mediated 
techniques (e.g., NASBA), e.g., for the production of the homologous nucleic acids of the 
invention are found in Berger, Sambrook, and Ausubel, as well as Mullis et al (1987) U.S. 
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Patent No. 4,683,202; U.S. Pat. No. 4,683,195, issued July 28, 1997; PCR Protocols: A 
Guide to Methods and Applications (Innis et al. t eds.) Academic Press Inc. San Diego, CA 
(1990) (Innis); Amheim & Levinson (October 1, 1990) C&EN 36-47; The Journal Of 
NIH Research (1991) 3, 81-94; (Kwoh et al. (1989) Proc. Natl Acad. ScL USA 86, 1173; 
5 Guatelli et al (1990) Proc. Natl Acad. ScL USA 87, 1874; Lomell et al (1989) 7. Clin. 
Chem. 35, 1826; Landegren et al (1988) Science 241, 1077-1080; Van Brunt (1990) 
Biotechnology 8, 291-294; Wu and Wallace (1989) Gene 4, 560; Barringer et al (1990) 
Gene 89, 117, and Sooknanan and Malek (1995) Biotechnology 13:563-564. 

PCR generally refers to a procedure wherein minute amounts of a specific 

10 piece of nucleic acid, RNA, and/or DNA, are amplified by methods well known in the art 
(see, e.g., U.S. Pat. No. 4,683,195 and other references above). Generally, sequence 
information from the ends of the region of interest or beyond is used, for design of 
oligonucleotide primers. Such primers will be identical or similar in sequence to the 
opposite strands of the template to be amplified. The 5' terminal nucleotides of the 

15 opposite strands may coincide with the ends of the amplified material. PCR may be used 
to amplify specific RNA or specific DNA sequences, recombinant DNA or RNA 
sequences, DNA and RNA sequences from total genomic DNA, and cDNA transcribed 
from total cellular RNA, bacteriophage or plasmid sequences, etc. PCR is one example, 
but not the only example, of a nucleic acid polymerase reaction method for amplifying a 

20 nucleic acid test sample comprising the use of a another (e.g., known) nucleic acid as a 
primer. Improved methods of cloning in vitro amplified nucleic acids are described in 
Wallace et al, U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic 
acids by PCR are summarized in Cheng et al. (1994) Nature 369:684-685 and the 
references therein, in which PCR amplicons of up to 40kb are generated. One of skill will 

25 appreciate that essentially any RNA can be converted into a double stranded DNA suitable 
for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a 
polymerase. See Ausubel, Sambrook and Berger, all supra. 

The present invention also relates to host cells which are transduced with 
vectors of the invention, and the production of polypeptides of the invention (including 

30 fragments thereof) by recombinant techniques. Host cells are genetically engineered (i.e. y 
transduced, transformed or transfected) with the vectors of this invention, which may be, 
for example, a cloning vector or an expression vector. The vector may be, for example, in 
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the form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating promoters, 
selecting transformants, or amplifying the interferon homologue gene. The culture 
conditions, such as temperature, pH and the like, are those previously used with the host 
5 cell selected for expression, and will be apparent to those skilled in the art and in the 
references cited herein, including, e.g., Freshney (1994) Culture of Animal Cells, a 
Manual of Basic Technique, 3d ed., Wiley-Liss, New York and the references cited 
therein. 

The interferon homologue polypeptides and proteins of the invention can 

10 also be produced in non-animal cells such as plants, yeast, fungi, bacteria and the like. In 
addition to Sambrook, Berger and Ausubel, details regarding cell culture can be found in 
Payne et al (1992) Plant Cell and Tissue Culture in Liquid Systems, John Wiley & Sons, 
Inc. New York, NY; Gamborg and Phillips (eds.) (1995) Plant Cell, Tissue and Organ 
Culture; Fundamental Methods, Springer Lab Manual, Springer-Verlag (Berlin 

15 Heidelberg New York) and Atlas and Parks (eds.) The Handbook of Microbiological 
Media (1993) CRC Press, Boca Raton, FL. 

The polynucleotides of the present invention may be included in any one of 
a variety of expression vectors for expressing a polypeptide. Such vectors include 
chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; 

20 bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from 

combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl 
pox virus, pseudorabies, adenovirus, adeno- associated virus, retroviruses and many others. 
Any vector that transducers genetic material into a cell, and, if replication is desired, 
which is replicable and viable in the relevant host can be used. 

25 The nucleic acid sequence in the expression vector is operatively linked to 

an appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
Examples of such promoters include: LTR or SV40 promoter, E. coli lac or trp promoter, 
phage lambda Pl promoter, and other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses. The expression vector also contains a 

30 ribosome binding site for translation initiation, and a transcription terminator. The vector 
optionally includes appropriate sequences for amplifying expression. In addition, the 
expression vectors optionally comprise one or more selectable marker genes to provide a 
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phenotypic trait for selection of transformed host cells, such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin 
resistance in E. coli. 

The vector containing the appropriate DNA sequence as described herein, 
5 as well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. Examples of appropriate 
expression hosts include: bacterial cells, such as E. coli, Streptomyces, and Salmonella 
typhimurium; fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, and 
Neurospora crassa; insect cells such as Drosophila and Spodoptera frugiperda\ 

10 mammalian cells such as CHO, COS, BHK, HEK 293 or Bowes melanoma; plant cells, 
etc. It is understood that not all cells or cell lines need to be capable of producing fully 
functional interferon homologues; for example, antigenic fragments of an interferon 
homologue may be produced in a bacterial or other expression system. The invention is 
not limited by the host cells employed. 

15 In bacterial systems, a number of expression vectors may be selected 

depending upon the use intended for the interferon homologue. For example, when large 
quantities of interferon homologue or fragments thereof are needed for the induction of 
antibodies, vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to, multifunctional E. 

20 coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the 
interferon homologue coding sequence may be ligated into the vector in-frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of beta-galactosidase 
so that a hybrid protein is produced; pIN vectors (Van Heeke & Schuster (1989) /. Biol. 
Chem. 264:5503-5509); pET vectors (Novagen, Madison WI); and the like. 

25 Similarly, in the yeast Saccharomyces cerevisiae a number of vectors 

containing constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH may be used for production of the interferon homologue proteins of the invention. 
For reviews, see Ausubel et aL {supra) and Grant et al. (1987; Methods in Enzymology 
153:516-544). 

30 In mammalian host cells, a number expression systems, such as viral-based 

systems, may be utilized. In cases where an adenovirus is used as an expression vector, a 
coding sequence is optionally ligated into an adenovirus transcription/translation complex 
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consisting of the late promoter and tripartite leader sequence. Insertion in a nonessential 
El or E3 region of the viral genome will result in a viable virus capable of expressing 
interferon homologue in infected host cells (Logan and Shenk (1984) Proc. Natl Acad. 
Sci. 81:3655-3659). In addition, transcription enhancers, such as the rous sarcoma virus 
5 (RSV) enhancer, may be used to increase expression in mammalian host cells. 

Additional Expression Elements 

Specific initiation signals can aid in efficient translation of an interferon 
homologue coding sequence. These signals can include, e.g., the ATG initiation codon 
and adjacent sequences. In cases where interferon homologue coding sequence, its 

10 initiation codon and upstream sequences are inserted into the appropriate expression 
vector, no additional translational control signals may be needed. However, in cases 
where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, 
is inserted, exogenous transcriptional control signals including the ATG initiation codon 
must be provided. Furthermore, the initiation codon must be in the correct reading frame 

15 to ensure transcription of the entire insert. Exogenous transcriptional elements and 

initiation codons can be of various origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of enhancers appropriate to the cell system 
in use (Scharf, D. et al (1994) Results Probl Cell Differ, 20:125-62; Bittner et al (1987) 
Methods in Enzymol. 153:516-544). 

20 Secretion/Localization Sequences 

Polynucleotides of the invention can also be fused, for example, in-frame to 

nucleic acid encoding a secretion/localization sequence, to target polypeptide expression 

to a desired cellular compartment, membrane, or organelle, or to direct polypeptide 

secretion to the periplasmic space or into the cell culture media. Such sequences are 

25 known to those of skill, and include secretion leader peptides, organelle targeting 

sequences (e.g., nuclear localization sequences, ER retention signals, mitochondrial transit 
sequences, chloroplast transit sequences), membrane localization/anchor sequences (e.g., 
stop transfer sequences, GPI anchor sequences), and the like. Polypeptides expressed by 
such polynucleotides of the invention may include the amino acid sequence corresponding 

30 to the secretion and/or localization sequence(s). 
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Expression Hosts 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a eukaryotic cell, such as 
a mammalian cell, a yeast cell, or a plant cell, or the host cell can be a prokaryotic cell, 
5 such as a bacterial cell. Introduction of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-Dextran mediated transfection, electroporation, or 
other common techniques (Davis, L., Dibner, M., and Battey, I. (1986) Basic Methods in 
Molecular Biology). The cell may include a nucleic acid of the invention, said nucleic 
acid encoding a polypeptide, wherein said cells expresses a polypeptide (e.g., an 

10 interferon-alpha homologue polypeptide having an antiviral or anti-proliferative activity as 
measured by the assays described herein). The invention also includes a vector 
comprising any nucleic acid of the invention described herein and includes a cell 
transduced by such a vector. Furthermore, Cells and transgenic animals which include any 
polypeptide or nucleic acid above or throughout this specification, e.g., produced by 

15 transduction of a vector of the invention, are an additional feature of the invention. 

A host cell strain is optionally chosen for its ability to modulate the 
expression of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the protein include, but are not limited to, acetylation, 
carboxylation, glycosylation, phosphorylation, lipidation and acylation. Post-translational 

20 processing which cleaves a "pre" or a "prepro" form of the protein may also be important 
for correct insertion, folding and/or function. Different host cells such as CHO, HeLa, 
BHK, MDCK, 293, WI38, etc. have specific cellular machinery and characteristic 
mechanisms for such post-translational activities and may be chosen to ensure the correct 
modification and processing of the introduced, foreign protein. 

25 For long-term, high-yield production of recombinant proteins, stable 

expression can be used. For example, cell lines which stably express a polypeptide of the 
invention are transduced using expression vectors which contain viral origins of 
replication or endogenous expression elements and a selectable marker gene. Following 
the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched 

30 media before they are switched to selective media. The purpose of the selectable marker 
is to confer resistance to selection, and its presence allows growth and recovery of cells 
which successfully express the introduced sequences. For example, resistant clumps of 
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stably transformed cells can be proliferated using tissue culture techniques appropriate to 
the cell type. 

Host cells transformed with a nucleotide sequence encoding a polypeptide 
of the invention are optionally cultured under conditions suitable for the expression and 
5 recovery of the encoded protein from cell culture. The protein or fragment thereof 
produced by a recombinant cell may be secreted, membrane-bound, or contained 
intracellulariy, depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides encoding mature 
interferon homologues of the invention can be designed with signal sequences which 
10 direct secretion of the mature polypeptides through a prokaryotic or eukaryotic cell 
membrane. 

Additional Polypeptide Sequences 

The polynucleotides of the present invention may also comprise a coding 
sequence fused in-frame to a marker sequence which, e.g., facilitates purification of the 

15 encoded polypeptide of the invention. Such purification facilitating domains include, but 
are not limited to, metal chelating peptides such as histidine-tryptophan modules that 
allow purification on immobilized metals, a sequence which binds glutathione {e.g. y GST), 
a hemagglutinin (HA) tag (corresponding to an epitope derived from the influenza 
hemagglutinin protein; Wilson, I. et al (1984) Ce//_37:767), maltose binding protein 

20 sequences, the FLAG epitope utilized in the FLAGS extension/affinity purification system 
(Immunex Corp., Seattle, WA), and the like. The inclusion of a protease-cleavable 
polypeptide linker sequence between the purification domain and the interferon 
homologue sequence is useful to facilitate purification. One expression vector 
contemplated for use in the compositions and methods described herein provides for 

25 expression of a fusion protein comprising a polypeptide of the invention fused to a 

polyhistidine region separated by an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography, as 
described in Porath et al (1992) Protein Expression and Purification 3:263-281), while 
the enterokinase cleavage site provides a means for separating the interferon homologue 

30 polypeptide from the fusion protein. pGEX vectors (Promega; Madison, WI) may also be 
used to express foreign polypeptides as fusion proteins with glutathione S-transferase 
(GST). In general, such fusion proteins are soluble and can easily be purified from lysed 
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cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case of GST- 
fusions) followed by elution in the presence of free ligand. 

Polypeptide Production and Recovery 

Following transduction of a suitable host strain and growth of the host 
5 strain to an appropriate cell density, the selected promoter is induced by appropriate means 
(e.g., temperature shift or chemical induction) and cells are cultured for an additional 
period. Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. Microbial cells 
employed in expression of proteins can be disrupted by any convenient method, including 

10 freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, or 
other methods, which are well know to those skilled in the art. 

As noted, many references are available for the culture and production of 
many cells, including cells of bacterial, plant, animal (especially mammalian) and 
archebacterial origin. See, e.g., Sambrook, Ausubel, and Berger (all supra), as well as 

15 Freshney (1994) Culture of Animal Cells, a Manual of Basic Technique, third edition, 
Wiley- Liss, New York and the references cited therein; Doyle and Griffiths (1997) 
Mammalian Cell Culture: Essential Techniques, John Wiley and Sons, NY; Humason 
(1979) Animal Tissue Techniques, 4th edition, W.H. Freeman and Company; and 
Ricciardelli et al. (1989) In vitro Cell Dev. Biol. 25:1016-1024. For plant cell culture and 

20 regeneration, Payne et al (1992) Plant Cell and Tissue Culture in Liquid Systems, John 
Wiley & Sons, Inc., New York, NY; Gamborg and Phillips (eds.) (1995) Plant Cell 
Tissue and Organ Culture; Fundamental Methods Springer Lab Manual Springer-Verlag 
(Berlin Heidelberg New York) and Plant Molecular Biology (1993) R.R.D. Croy, ed., 
Bios Scientific Publishers, Oxford, U.K. ISBN 0 12 198370 6. Cell culture media in 

25 general are set forth in Atlas and Parks (eds.) The Handbook of Microbiological Media 
(1993) CRC Press, Boca Raton, FL. Additional information for cell culture is found in 
available commercial literature such as the Life Science Research Cell Culture Catalogue 
(1998) from Sigma- Aldrich, Inc (St. Louis, MO) ("Sigma-LSRCCC") and, e.g., the Plant 
Culture Catalogue and supplement (1997) also from Sigma- Aldrich, Inc. (St. Louis, MO) 

30 ("Sigma-PCCS"). 

Polypeptides of the invention can be recovered and purified from 
recombinant cell cultures by any of a number of methods well known in the art, including 
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ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography (e.g., using any of the tagging systems noted 
herein), hydroxylapatite chromatography, and lectin chromatography. Protein refolding 
5 steps can be used, as desired, in completing configuration of the mature protein. Finally, 
high performance liquid chromatography (HPLC) can be employed in the final 
purification steps. In addition to the references noted supra, a variety of purification 
methods are well known in the art, including, e.g., those set forth in Sandana (1997) 
Bioseparation of Proteins, Academic Press, Inc.; and Bollag et al. (1996) Protein 

10 Methods, 2 nd Edition, Wiley-Liss, NY; Walker (1996) The Protein Protocols Handbook, 
Humana Press, NJ, Harris and Angal (1990) Protein Purification Applications: A 
Practical Approach, IRL Press at Oxford, Oxford, England; Harris and Angal, Protein 
Purification Methods: A Practical Approach, IRL Press at Oxford, Oxford, England; 
Scopes (1993) Protein Purification: Principles and Practice 3 rd Edition, Springer Verlag, 

15 NY; Janson and Ryden (1998) Protein Purification: Principles, High Resolution Methods 
and Applications, Second Edition, Wiley- VCH, NY; and Walker (1998) Protein Protocols 
on CD-ROM, Humana Press, NJ. 

In vitro Expression Systems 

Cell-free transcription/translation systems can also be employed to produce 
20 polypeptides using DNAs or RNAs of the present invention. Several such systems are 

commercially available. A general guide to in vitro transcription and translation protocols 
is found in Tymms (1995) In vitro Transcription and Translation Protocols: Methods in 
Molecular Biology, Volume 37, Garland Publishing, NY. 

Modified Amino Acids 
25 Polypeptides of the invention may contain one or more modified amino 

acids. The presence of modified amino acids may be advantageous in, for example, (a) 
increasing polypeptide serum half life, (b) reducing polypeptide antigenicity, (c) 
increasing polypeptide storage stability. Amino acid(s) are modified, for example, co- 
translationally or post-translationally during recombinant production {e.g., N-linked 
30 glycosylation at N-X-S/T motifs during expression in mammalian cells) or modified by 
synthetic means. 
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Non-limiting examples of a modified amino acid include a glycosylated 
amino acid, a sulfated amino acid, a prenylated (e.g., farnesylated, geranylgeranylated) 
amino acid, an acetylated amino acid, an acylated amino acid, a PEG-ylated amino acid, a 
biotinylated amino acid, a carboxylated amino acid, a phosphorylated amino acid, and the 
5 like. References adequate to guide one of skill in the modification of amino acids are 
replete throughout the literature. Example protocols are found in Walker (1998) Protein 
Protocols on CD-ROM Human Press, Towata, NJ. 

The polynucleotides and polypeptides of the invention have a variety of 
uses, including, but not limited to, for example: in recombinant production (i.e., 

10 expression) of the recombinant interferon-alpha homologues of the invention; as 

therapeutic and prophylactic agents in methods of in vivo and ex vivo treatment of a 
variety of diseases, disorders, and conditions in a variety of subjects; for use in in vitro 
methods, such as diagnostic and screening methods, to detect, diagnose, and treat a variety 
of diseases, disorders, and conditions (e.g., cancers, viral-based disorders, angiogenic- 

15 based disorders) in a variety of subjects (e.g., mammals); as immunogens; in gene therapy 
methods and DNA- or RNA-based delivery methods to deliver or administer in vivo, ex 
vivo, or in vitro biologically active polypeptides of the invention to a tissue, population or 
cells, organ, graft, bodily system of a subject (e.g., organ system, lymphatic system, blood 
system, etc.); as DNA vaccines, multi-component vaccines for use in prophylactic or 

20 therapeutic treatment of a variety of diseases, disorders, or other conditions (e.g., cancers, 
viral-based disorders, angiogenic-based disorders) in a variety of subjects (e.g., 
mammals); as adjuvants to enhance or augment an immune response in a subject; as a 
component of a multiple-step boosting vaccination method (e.g., a format comprising a 
prime vaccination by delivery of a DNA or RNA nucleotide (e.g., a nucleotide encoding a 

25 polypeptide of the invention or encoding another polypeptide) followed by a second boost 
of a polypeptide (e.g., a polypeptide of the invention or other polypeptide); as diagnostic 
probes for the presence of complementary or partially complementary nucleic acids 
(including for detection of natural interferon-alpha coding nucleic acids); as substrates for 
further reactions, e.g., shuffling reactions, mutation reactions, or other diversity generation 

30 reactions to produce new and/or improved interferon-alpha homologues and new 

interferon-alpha nucleic acids encoding such homologues, e.g., to evolve novel therapeutic 
or prophylactic properties, and the like; for polymerase chain reactions (PCR) or cloning 
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methods, e.g., including digestion or ligation reactions, to identify new and/or improved 
naturally-occurring or non-naturally occurring IFN-alpha nucleic acids and polypeptides 
encoded therefrom. Polynucleotides which encode an interferon homologue of the 
invention, or complements of the polynucleotides, are optionally administered to a cell to 
5 accomplish a therapeutically or prophylactically useful process or to express a 

therapeutically useful product in vivo, ex vivo, or in vitro. These applications, including in 
vivo or ex vivo applications, including, e.g., gene therapy, include a multitude of 
techniques by which gene expression may be altered in cells. Such methods include, for 
instance, the introduction of genes for expression of, e.g., therapeutically or 

10 prophylactically useful polypeptides, such as the interferon homologues of the present 
invention. Such methods include, for example, infecting with a retrovirus comprising the 
polynucleotides and/or polypeptides of the invention. Optionally, the retrovirus further 
comprises additional exogenous, e.g., therapeutic or prophylactic gene construct, 
sequences. In one aspect, the invention provides gene therapy methods of prophylactically 

15 or therapeutically treating a disease, disorder or condition in a subject in need of such 
treatment by administering in vivo, ex vivo, or in vitro one or more nucleic acids of the 
invention described herein to one or more cells of a subject, including an organism or 
mammal, including, e.g., a human, primate, mouse, pig, cow, goat, rabbit, rat, guinea pig, 
hamster, horse, sheep; or a non-mammalian vertebrate such as a bird (e.g., a chicken or 

20 duck) or a fish, or invertebrate, as described in more detail below. 

In another aspect, the invention provides methods of prophylactically or 
therapeutically treating a disease, disorder or condition in a subject in need of such 
treatment by administering in vivo, ex vivo, or in vitro one or more polypeptides of the 
invention described herein to one or more cells of a subject (including those defined 

25 herein), as described in more detail below. 

Polypeptide Expression 

Polynucleotides encoding interferon homologue polypeptides of the 
invention are particularly useful for in vivo or ex vivo therapeutic or prophylactic 
applications, using techniques well known to those skilled in the art. For example, 
30 cultured cells are engineered ex vivo with a polynucleotide (DNA or RNA), with the 

engineered cells then being returned to the patient. Cells may also be engineered in vivo 
or ex vivo for expression of a polypeptide in vivo or ex vivo, respectively. 
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A number of viral vectors suitable for organismal in vivo or ex vivo 
transduction and expression are known. Such vectors include retroviral vectors (see 
Miller(1992) Curr. Top. Microbiol Immunol 158:1-24; Salmons and Gunzburg (1993) 
Human Gene Therapy 4:129-141; Miller et al (1994) Methods in Enzymology 211 :581- 
5 599) and adeno-associated vectors (reviewed in Carter (1992) Curr, Opinion Biotech. 
3:533-539; Muzcyzka (1992) Curr. Top. Microbiol Immunol 158:97-129). Other viral 
vectors that are used include adenoviral vectors, herpes viral vectors and Sindbis viral 
vectors, as generally described in, e.g., Jolly (1994) Cancer Gene Therapy '.1:51-64; 
Latchman (1994; Molec. Biotechnol. 2:179-195; and Johanning et al (1995; Nucl Acids 

10 Res. 23:1495-1501. 

Gene therapy provides methods for combating chronic infectious diseases 
(e.g., HIV infection, viral hepatitis, Herpes Simplex Virus (HSV), hepatitis B (HepB), 
dengue virus, etc.\ as well as non-infectious diseases including cancer and allergic 
diseases and some forms of congenital defects such as enzyme deficiencies. Several 

15 approaches for introducing nucleic acids into cells in vivo, ex vivo and in vitro have been 
used. These include liposome based gene delivery (Debs and Zhu (1993) WO 93/24640 
and U.S. Pat. No. 5,641,662; Mannino and Gould-Fogerite (1988) BioTechniques 
6(7):682-691; Rose, U.S. Pat No. 5,279,833; Brigham (1991) WO 91/06309; and Feigner 
et al (1987) Proc. Nat'l Acad. Sci. USA 84:7413-7414): Brigham et al. (1989) Am. J. 

20 Med. Sci. 298:278-281; Nabel et al. (1990) Science 249:1285-1288; Hazinski et al (1991; 
Am. J. Resp. Cell Molec. Biol 4:206-209; and Wang and Huang (1987) Proc. Nat'l Acad, 
ScL (USA) 84:7851-7855).; adenoviral vector mediated gene delivery, e.g., to treat cancer 
(see, e.g., Chen et al (1994) Proc. Nat'l Acad, Sci. USA 9 1:3054-305 7; Tong et al (1996) 
Gynecol Oncol. 61:175-179; Clayman et al (1995) Cancer Res. 5:1-6; O'Malley etal 

25 (1995) Cancer Res. 55:1080-1085; Hwang et al (1995) Am. J. Respir. Cell Mol. Biol 
13:7-16; Haddada etal. (1995; Curr. Top. Microbiol Immunol. 199 (Pt. 3):297-306; 
Addison et al. (1995) Proc, Nat'l Acad. Sci, C/5A_92:8522-8526; Colak et al, (1995) Brain 
Res. 691:76-82; Crystal (1995) Science 270:404-410; Elshami et al. (1996) Human Gene 
Ther. 7:141-148; Vincent et al (1996; J. Neurosurg.S5:64S-654), and many other 

30 diseases. Replication-defective retroviral vectors harboring therapeutic polynucleotide 
sequence as part of the retroviral genome have also been used, particularly with regard to 
simple MuLV vectors. See, e,g., Miller et al (1990) Mol Cell Biol.J0A239 (1990); 
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Kolberg (1992 )7. NIH Res. 4:43, and Cornetta et al. (1991) Hum. Gene Ther._2:2l5). 
Nucleic acid transport coupled to ligand-specific, cation -based transport systems (Wu and 
Wu (1988) J. Biol. Chem. 263:14621-14624) have also been used Naked DNA 
expression vectors have also been described (Nabel et al. (1990), supra); Wolff et al. 
5 (1990) Science 247:1465-1468). In general, these approaches can be adapted to the 

invention by incorporating nucleic acids encoding the interferon homologues herein into 
the appropriate vectors. 

General texts which describe gene therapy protocols, which can be adapted 
to the present invention by introducing the nucleic acids of the invention into patients, 
10 include Robbins (1996) Gene Therapy Protocols, Humana Press, NJ, and Joyner (1993) 
Gene Targeting: A Practical Approach, IRL Press, Oxford, England. 

Anti sense Technology 

In addition to expression of the nucleic acids of the invention as gene 
replacement nucleic acids, the nucleic acids are also useful for sense and anti-sense 

15 suppression of expression, e.g., to down-regulate expression of a nucleic acid of the 

invention, once expression of the nucleic acid is no-longer desired in the cell. Similarly, 
the nucleic acids of the invention, or subsequences or anti-sense sequences thereof, can 
also be used to block expression of naturally occurring homologous nucleic acids. A 
variety of sense and anti-sense technologies are known in the art, e.g., as set forth in 

20 Lichtenstein and Nellen (1997) Antisense Technology: A Practical Approach IRL Press at 
Oxford University, Oxford, England, and in Agrawal (1996) Antisense Therepeutics 
Humana Press, NJ, and the references cited therein. 

Pharmaceutical Compositions 

The polynucleotides and polypeptides of the invention (including vectors, 
25 cells, antibodies, etc., comprising polynucleotides or polypeptides of the invention) may 
be employed for therapeutic and prophylactic uses in combination with a suitable 
pharmaceutical carrier. Such compositions comprise a therapeutically or prophylactically 
effective amount of the polynucleotide or polypeptide of the invention, and a 
pharmaceutical ly acceptable carrier or excipient. A pharmaceutical ly acceptable carrier 
30 encompasses any of the standard pharmaceutical carriers, buffers and excipients. Such a 
carrier or excipient includes, but is not limited to, saline, buffered saline (e.g., phosphate- 
buffered saline solution), dextrose, water, glycerol, ethanol, emulsions (such as an 
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oil/water or water/oil emulsion), various types of wetting agents and/or adjuvants, and 
combinations thereof. Suitable pharmaceutical carriers and agents are described in 
Remington's Pharmaceutical Sciences (Mack Publishing Co., Easton, 19 th ed. 1995). 
The formulation should suit the mode of administration of the active agent (e.g., 
nucleotide, polypeptide, vector, cell, etc.). Methods of administering nucleic acids, 
polypeptides, vectors, cells, antibodies, and proteins are well known in the art, and further 
discussed below. 

Use as Probes 

Also contemplated are uses of polynucleotides, also referred to herein as 
oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably 
at least 20, 30, or 50 bases, which hybridize under at least highly stringent (or ultra-high 
stringent or ultra-ultra- high stringent conditions) conditions to an interferon homologue 
polynucleotide sequence described above. The polynucleotides may be used as probes, 
primers, sense and antisense agents, and the like, according to methods as noted supra. 

SEQUENCE VARIATIONS 

Silent Variations 

It will be appreciated by those skilled in the art that due to the degeneracy 
of the genetic code, a multitude of nucleic acids sequences encoding interferon homologue 
polypeptides of the invention may be produced, some which may bear minimal sequence 
homology to the nucleic acid sequences explicitly disclosed herein. 
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Table 1 
Codon Table 



Amino acids 


Codon 


Alanine 


Ala 


A 


GCA 


***** 

GCC 


y->* y**"<1 

GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UTJC 


uuu 










Glycine 


Gly 


G 


GGA 


f~\ 

GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











5 For instance, inspection of the codon table (Table 1) shows that codons 

AGA, AGG, CGA, CGC, CGG, and CGU all encode the amino acid arginine. Thus, at 
every position in the nucleic acids of the invention where an arginine is specified by a 
codon, the codon can be altered to any of the corresponding codons described above 
without altering the encoded polypeptide. It is understood that U in an RNA sequence 

10 corresponds to T in a DNA sequence. 

Using, as an example, the nucleic acid sequence corresponding to 
nucleotides 1-15 of SEQ ID NO:l, TGT GAT CTG CCT CAG, a silent variation of this 
sequence includes TGC GAC TTA CCA CAA, both sequences which encode the amino 
acid sequence CDLPQ, corresponding to amino acids 1-5 of SEQ ID NO:36. 

15 Such "silent variations" are one species of "conservatively modified 

variations," discussed below. One of skill will recognize that each codon in a nucleic acid 
(except AUG, which is ordinarily the only codon for methionine) can be modified by 
standard techniques to encode a functionally identical polypeptide. Accordingly, each 
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silent variation of a nucleic acid which encodes a polypeptide is implicit in any described 
sequence. The invention provides each and every possible variation of nucleic acid 
sequence encoding a polypeptide of the invention that could be made by selecting 
combinations based on possible codon choices. These combinations are made in 
5 accordance with the standard triplet genetic code (e.g., as set forth in Table 1) as applied to 
the nucleic acid sequence encoding an interferon homologue polypeptide of the invention. 
All such variations of every nucleic acid herein are specifically provided and described by 
consideration of the sequence in combination with the genetic code. 

Conservative Variations 

10 "Conservatively modified variations" or, simply, "conservative variations" 

of a particular nucleic acid sequence refers to those nucleic acids which encode identical 
or essentially identical amino acid sequences, or, where the nucleic acid does not encode 
an amino acid sequence, to essentially identical sequences. One of skill will recognize 
that individual substitutions, deletions or additions which alter, add or delete a single 

15 amino acid or a small percentage of amino acids (typically less than 5%, more typically 
less than 4%, 3%, 2% or 1%) in an encoded sequence are "conservatively modified 
variations" where the alterations result in the deletion of an amino acid, addition of an 
amino acid, or substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing functionally similar amino acids 

20 are well known in the art. Table 2 sets forth six groups which contain amino acids that are 
"conservative substitutions" for one another. 

Table 2 

Conservative Substitution Groups 



1 


Alanine (A) 


Serine (S) 


Threonine (T) 




2 


Aspartic acid (D) 


Glutamic acid (E) 






3 


Asparagine (N) 


Glutamine (Q) 






4 


Arginine (R) 


Lysine (K) 






5 


Isoleucine (I) 


Leucine (L) 


Methionine (M) 


Valine (V) 


6 


Phenylalanine (F) 


Tyrosine (Y) 


Tryptophan (W) 





25 

Thus, "conservatively substituted variations" or "conservative 
substitutions" of a listed polypeptide sequence of the present invention include 
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substitutions of a small percentage, typically less than 5%, more typically less than 4%, 
3%, 2% or 1%, of the amino acids of the polypeptide sequence, with a conservatively 
selected amino acid of the same conservative substitution group. 

For example, a conservatively substituted variation of the polypeptide 
5 identified herein as SEQ ID NO:36 will contain "conservative substitutions", according to 
the six groups defined above, in up to about 8 or 9 residues (Le., about 5% of the amino 
acids) in the 166-amino acid polypeptide. 

In a further example, if four conservative substitutions were localized in 
the region corresponding to amino acid residues 141-166 of SEQ ID NO:36, examples of 
10 conservatively substituted variations of this region, 

WEVVR AEIMR SFSFS TNLQK RLRRKE include: 
WEVVR SEDvIR SFS YS TNLQR RLRRKD and 

WELVR AEIVR SFSFS TNLNK RLRKKE and the like, in accordance with the 
conservative substitutions listed in Table 2 (in the above example, conservative 
15 substitutions are underlined). Listing of a protein sequence herein, in conjunction with the 
above substitution table, provides an express listing of all conservatively substituted 
proteins. 

Finally, the addition of sequences which do not alter the encoded activity of 
a nucleic acid molecule, such as the addition of a non-functional sequence, is a 

20 conservative variation of the basic nucleic acid. 

One of ordinary skill will appreciate that many conservative variations of 
the nucleic acid constructs which are disclosed yield a functionally identical construct. 
For example, as discussed above, owing to the degeneracy of the genetic code, "silent 
substitutions" {i.e., substitutions in a nucleic acid sequence which do not result in an 

25 alteration in an encoded polypeptide) are an implied feature of every nucleic acid sequence 
which encodes an amino acid. Similarly, "conservative amino acid substitutions," in one 
or a few amino acids in an amino acid sequence are substituted with different amino acids 
with highly similar properties, are also readily identified as being highly similar to a 
disclosed construct. Such conservative variations of each disclosed sequence are a feature 

30 of the present invention. 
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Nucleic Acid Hybridization 

Nucleic acids "hybridize" when they associate, typically in solution. 
Nucleic acids hybridize due to a variety of well characterized physico-chemical forces, 
such as hydrogen bonding, solvent exclusion, base stacking and the like. An extensive 
5 guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory 

Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic Acid 
Probes, part I, chapter 2, "Overview of principles of hybridization and the strategy of 
nucleic acid probe assays," (Elsevier, New York), as well as in Ausubel, supra, Hames 
and Higgins (1995) Gene Probes 1, IRL Press at Oxford University Press, Oxford, 
10 England (Hames and Higgins 1) and Hames and Higgins (1995) Gene Probes 2, IRL Press 
at Oxford University Press, Oxford, England (Hames and Higgins 2) provide details on the 
synthesis, labeling, detection and quantification of DNA and RNA, including 
oligonucleotides. 

"Stringent hybridization wash conditions" in the context of nucleic acid 

15 hybridization experiments, such as Southern and northern hybridizations, are sequence 
dependent, and are different under different environmental parameters. An extensive 
guide to the hybridization of nucleic acids is found in Tijssen (1993), supra, and in Hames 
and Higgins 1 and Hames and Higgins 2, supra. 

For purposes of the present invention, generally, "highly stringent" 

20 hybridization and wash conditions are selected to be about 5° C or less lower lower than 
the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH 
(as noted below, highly stringent conditions can also be referred to in comparative terms). 
The T m is the temperature (under defined ionic strength and pH) at which 50% of the test 
sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected 

25 to be equal to the T m for a particular probe. 

The T m is the temperature of the nucleic acid duplexes indicates the 
temperature at which the duplex is 50% denatured under the given conditions and its 
represents a direct measure of the stability of the nucleic acid hybrid. Thus, the T m 
corresponds to the temperature corresponding to the midpoint in transition from helix to 

30 random coil; it depends on length, nucleotide composition, and ionic strength for long 
stretches of nucleotides. 
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After hybridization, unhybridized nucleic acid material can be removed by 
a series of washes, the stringency of which can be adjusted depending upon the desired 
results. Low stringency washing conditions (e.g., using higher salt and lower temperature) 
increase sensitivity, but can product nonspecific hybridization signals and high 
5 background signals. Higher stringency conditions {e.g., using lower salt and higher 

temperature that is closer to the hybridization temperature) lowers the background signal, 
typically with only the specific signal remaining. See Rapley, R. and Walker, J.M. eds., 
Molecular Biomethods Handbook (Humana Press, Inc. 1998) (hereinafter "Rapley and 
Walker"), which is incorporated herein by reference in its entirety for all purposes. 
10 The T m of a DNA-DNA duplex can be estimated using the following 

equation: 

T m (°C) = 81.5°C + 16.6 (log l0 M) + 0.41 (%G + C) - 0.72 (%f) - 500/n, 
where M is the molarity of the monovalent cations (usually Na+), (%G + 
C) is the percentage of guanosine (G) and cystosine (C ) nucleotides, (%f) is the 
15 percentage of formalize and n is the number of nucleotide bases (i.e., length) of the hybrid. 
See Rapley and Walker, supra. 

The T m of an RNA-DNA duplex can be estimated as follows: 
T m (°C) = 79.8°C + 18.5 (iogi 0 M) + 0.58 (%G + C) - 1 1.8(%G + C) 2 - 0.56 
(%f) - 820/n,where M is the molarity of the monovalent cations (usually Na+), (%G + 
20 C)is the percentage of guanosine (G ) and cystosine (C ) nucleotides, (%f) is the 

percentage of formamide and n is the number of nucleotide bases (i.e., length) of the 
hybrid. Id. 

Equations 1 and 2 are typically accurate only for hybrid duplexes longer 
than about 100-200 nucleotides. Id. 
25 The Tm of nucleic acid sequences shorter than 50 nucleotides can be 

calculated as follows: 

T m <°C) = 4(G + C) + 2(A + T), 

where A (adenine), C, T (thymine), and G are the numbers of the 
corresponding nucleotides. 
30 An example of stringent hybridization conditions for hybridization of 

complementary nucleic acids which have more than 100 complementary residues on a 
filter in a Southern or northern blot is 50% formalin with 1 mg of heparin at 42°C, with 
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the hybridization being carried out overnight. An example of stringent wash conditions is 
a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook, supra for a description of SSC 
buffer). Often the high stringency wash is preceded by a low stringency wash to remove 
background probe signal. An example low stringency wash is 2x SSC at 40°C for 15 
5 minutes. 

In general, a signal to noise ratio of 2.5x-5x (or higher) than that observed 
for an unrelated probe in the particular hybridization assay indicates detection of a specific 
hybridization. Detection of at least stringent hybridization between two sequences in the 
context of the present invention indicates relatively strong structural similarity or 
10 homology to, e.g., the nucleic acids of the present invention provided in the sequence 
listings herein. 

As noted, "highly stringent" conditions are selected to be about 5° C or less 
lower than the thermal melting point (T m ) for the specific sequence at a defined ionic 
strength and pH. Target sequences that are closely related or identical to the nucleotide 

15 sequence of interest (e.g., "probe") can be identified under highly stringency conditions. 
Lower stringency conditions are appropriate for sequences that are less complementary. 
See, e.g., Rapley and Walker, supra. 

Comparative hybridization can be used to identify nucleic acids of the 
invention, and this comparative hybridization method is a preferred method of 

20 distinguishing nucleic acids of the invention. Detection of highly stringent hybridization 
between two nucleotide sequences in the context of the present invention indicates 
relatively strong structural similarity/homology to, e.g., the nucleic acids provided in the 
sequence listing herein. Highly stringent hybridization between two nucleotide sequences 
demonstrates a degree of similarity or homology of structure, nucleotide base composition, 

25 arrangement or order that is greater than that detected by stringent hybridization 

conditions. In particular, detection of highly stringent hybridization in the context of the 
present invention indicates strong structural similarity or structural homology (e.g., 
nucleotide structure, base composition, arrangement or order) to, e.g., the nucleic acids 
provided in the sequence listings herein. For example, it is desirable to identify test 

30 nucleic acids which hybridize to the exemplar nucleic acids herein under stringent 
conditions. 
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Thus, one measure of stringent hybridization is the ability to hybridize to 
one of the listed nucleic acids (e.g., nucleic acid sequences SEQ ID NO:l to SEQ ID 
NO:35, and SEQ ID NO:72 to SEQ ID NO:78, and complementary polynucleotide 
sequences thereof) under highly stringent conditions (or very stringent conditions, or ultra- 
5 high stringency hybridization conditions, or ultra-ultra high stringency hybridization 
conditions). Stringent hybridization (including, e.g., highly stringent, ultra-high 
stringency, or ultra-ultra high stringency hybridization conditions) and wash conditions 
can easily be determined empirically for any test nucleic acid. 

For example, in determining highly stringent hybridization and wash 

10 conditions, the hybridization and wash conditions are gradually increased (e.g., by 

increasing temperature, decreasing salt concentration, increasing detergent concentration 
and/or increasing the concentration of organic solvents, such as formalin, in the 
hybridization or wash), until a selected set of criteria are met. For example, the 
hybridization and wash conditions are gradually increased until a probe comprising one or 

15 more nucleic acid sequences selected from SEQ ID NO: 1 to SEQ ID NO:35, SEQ ID 

NO:72 to SEQ ID NO:78, and complementary polynucleotide sequences thereof, binds to 
a perfectly matched complementary target (again, a nucleic acid comprising one or more 
nucleic acid sequences selected from SEQ ID NO:l to SEQ ID NO:35, SEQ ID NO:72 to 
SEQ ID NO:78, and complementary polynucleotide sequences thereof), with a signal to 

20 noise ratio that is at least 2.5x, and optionally 5x or more as high as that observed for 

hybridization of the probe to an unmatched target. In this case, the unmatched target is a 
nucleic acid corresponding to a known alpha interferon, e.g., an alpha interferon nucleic 
acid that is present in a public database such as GenBank™ at the time of filing of the 
subject application. Examples of such unmatched target nucleic acids include, e.g., those 

25 with the following GenBank accession numbers: J00210 (alpha-D), J00207 (AIpha-A), 
X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha- 14), V00545 
(IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), 
V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 (Gx-1), 
101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 108313 

30 (alpha-Con 1). Additional such sequences can be identified in GenBank by one of ordinary 
skill in the art. Nomenclature of the human interferon genes and proteins is discussed in 
Diaz et al., (1996) J. Interferon and Cytokine Res. 16: 179-180 and Allen et ah (1996) J. 
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Interferon and Cytokine Res. 16:181-184, respectively, each of which is incorporated 
herein by reference in its entirety for all purposes. 

A test nucleic acid is said to specifically hybridize to a probe nucleic acid 
when it hybridizes at least V2 as well to the probe as to the perfectly matched 
5 complementary target, i.e., with a signal to noise ratio at least V2 as high as hybridization 
of the probe to the target under conditions in which the perfectly matched probe binds to 
the perfectly matched complementary target with a signal to noise ratio that is at least 
about 2.5x-10x, typically 5x-10x as high as that observed for hybridization to any of the 
unmatched target nucleic acids represented by GenBank accession numbers J00210 

10 (alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), 
V00542 (alpha-14), V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 
(alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 
pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, 
V00549 (alpha-2a), and 108313 (alpha-Conl), or other similar interferon-alpha sequences 

15 presented in GenBank. 

Ultra high-stringency hybridization and wash conditions are those in which 
the stringency of hybridization and wash conditions are increased until the signal to noise 
ratio for binding of the probe to the perfectly matched complementary target nucleic acid 
is at least lOx as high as that observed for hybridization to any of the unmatched target 

20 nucleic acids represented by GenBank accession numbers J00210 (alpha-D), J00207 

(Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), 
V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 
(Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 

25 108313 (alpha-Conl), or other similar IFN-alpha sequences presented in GenBank A target 
nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio 
of at least Vi that of the perfectly matched complementary target nucleic acid is said to 
bind to the probe under ultra-high stringency conditions. 

Similarly, even higher levels of stringency can be determined by gradually 

30 increasing the hybridization and/or wash conditions of the relevant hybridization assay. 
For example, those in which the stringency of hybridization and wash conditions are 
increased until the signal to noise ratio for binding of the probe to the perfectly matched 
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complementary target nucleic acid is at least lOx, 20X, 50X, 100X, or 50OX or more as 
high as that observed for hybridization to any of the unmatched target nucleic acids 
represented by GenBank accession numbers J00210 (alpha-D), J00207 (Alpha-A), 
X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), V00545 
5 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), 
V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha- 10 pseudogene), R0067 (Gx-1), 
101614, 101787, 107821, M12350 (alpha-F), M38289,V00549 (alpha-2a), and 108313 
(alpha-Con l),or other similar interferon-alpha sequences presented in GenBank, can be 
identified. A target nucleic acid which hybridizes to a probe under such conditions, with a 

10 signal to noise ratio of at least l A that of the perfectly matched complementary target 
nucleic acid is said to bind to the probe under ultra-ultra-high stringency conditions. 

Target nucleic acids which hybridize to the nucleic acids represented by 
SEQ ED NO:l to SEQ ID NO:35 and SEQ ID NO:72 to SEQ ID NO:78 under high, ultra- 
high and ultra-ultra high stringency conditions are a feature of the invention. Examples of 

15 such nucleic acids include those with one or a few silent or conservative nucleic acid 
substitutions as compared to a given nucleic acid sequence. 

Nucleic acids which do not hybridize to each other under stringent 
conditions are still substantially identical if the polypeptides which they encode are 
substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the 

20 maximum codon degeneracy permitted by the genetic code, or when antisera or antiserum 
generated against one or more of SEQ ID NO:36 to SEQ ID NO:70 and SEQ ID NO:79 to 
SEQ ID NO:85 which has been subtracted using the polypeptides encoded by known 
interferon-alpha sequences, including, e.g., the those encoded by the following interferon- 
alpha nucleic acid sequences in GenBank: Accession numbers J00210 (alpha-D), J00207 

25 (Alpha-A), X02958 (AIpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha. 14), 
V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha- 10 pseudogene), R0067 
(Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 
108313 (alpha-Conl), or other similar interferon-alpha sequences presented in GenBank. 

30 Further details on immunological identification of polypeptides of the invention are found 
below. Additionally, for distinguishing between duplexes with sequences of less than 
about 100 nucleotides, a TMAC1 hybridization procedure known to those of ordinary skill 
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in the art can be used. See, e.g., Sorg, U. et al. 1 Nucleic Acids Res. (Sept. 11, 1991) 
19(17), incorporated herein by reference in its entirety for all purposes. 

In one aspect, the invention provides a nucleic acid which comprises a 
unique subsequence in a nucleic acid selected from SEQ ID NO:l to SEQ ID NO:35 or 
5 SEQ ID NO:72 to SEQ ID NO:78. The unique subsequence is unique as compared to a 
nucleic acid corresponding to any known interferon-alpha nucleic acid sequence 
including, e.g., the known sequences represented by GenBank accession numbers J00210 
(alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), 
V00542 (alpha-14), V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 

10 (alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 
pseudogene), A12109, R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), 
M38289, V00549 (alpha-2a), and 108313 (alpha-Conl), or other similar interferon-alpha 
sequences presented in GenBank. Such unique subsequences can be determined by 
aligning any of SEQ ED NO:l to SEQ ID NO:35 or SEQ ID NO:72 to SEQ ID NO:78 

15 against the complete set of nucleic acids corresponding to GenBank accession numbers of 
known interferon-alpha nucleic acid sequences, such as, e.g., J00210 (alpha-D), J00207 
(Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), 
V00545 (TEN-IB), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), 

20 A12109 (alpha-4B), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), 

M38289,V00549 (alpha-2a), and 108313 (alpha-Conl), or other similar interferon-alpha 
sequences presented in GenBank. Alignment can be performed using the BLAST 
algorithm set to default parameters. Any unique subsequence is useful, e.g., as a probe to 
identify the nucleic acids of the invention. 

25 Similarly, the invention includes a polypeptide which comprises a unique 

amino acid subsequence in a polypeptide selected from: SEQ ID NO:36 to SEQ ID NO:70 
or SEQ ID NO:79 to SEQ ED NO:85. Here, the unique subsequence is unique as 
compared to an amino acid subsequence of a known interferon-alpha polypeptide 
including, e.g., an amino acid subsequence of a polypeptide encoded by a known 

30 interferon-alpha nucleic acid corresponding to any of GenBank accession numbers: 
J00210 (alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 
(alpha-H), V00542 (alpha-14), V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), 
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V00540 (alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 
(alpha-10 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), 
M38289, V00549 (alpha-2a), and 108313 (alpha-Con 1), or other similar interferon-alpha 
nucleic acid or polypeptide sequences presented in GenBank. Here again, the polypeptide 
5 is aligned against the complete set of known interferon-alpha polypeptide sequences, such 
as those polypeptides encoded by nucleic acids corresponding to GenBank accession 
numbers J00210 (alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), 
V00533 (alpha-H), V00542 (alpha-14), V00545 (IFN-1B), X03125 (alpha-8), X02957 
(alpha- 16), V00540 (alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), 

10 X02961 (alpha-10 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha- 
F), - M38289,V00549 (alpha-2a), and 108313 (alpha-Conl), (referred to as "the control 
polypeptides") (note that where the sequence corresponds to a non-translated sequence 
such as a pseudo gene, the corresponding polypeptide is generated simply by in silico 
translation of the nucleic acid sequence into an amino acid sequence, where the reading 

15 frame is selected to correspond to the reading frame of homologous alpha interferon 
nucleic acids) or other similar interferon-alpha nucleic acid or polypeptide sequences 
presented in GenBank. 

In addition, the present invention provides a target nucleic acid which 
hybridizes under at least stringent or highly stringent conditions (or conditions of greater 

20 stringency) to a unique coding oligonucleotide which encodes a unique subsequence in a 
polypeptide selected from: SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ 
ED NO:85, wherein the unique subsequence is unique as compared to a an amino acid 
subsequence of a known interferon-alpha polypeptide sequence shown in GenBank or to a 
polypeptide corresponding to any of the control polypeptides. Unique sequences are 

25 determined as noted above. 

In one example, the stringent conditions are selected such that a perfectly 
complementary oligonucleotide to the coding oligonucleotide hybridizes to the coding 
oligonucleotide with at least about a 5-10x higher signal to noise ratio than for 
hybridization of the perfectly complementary oligonucleotide to a control nucleic acid 

30 corresponding to any of the control polypeptides. Conditions can be selected such that 
higher ratios of signal to noise are observed in the particular assay which is used, e.g., 
about 15x, 20x, 30x, 50x or more. In this example, the target nucleic acid hybridizes to 
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the unique coding oligonucleotide with at least a 2x higher signal to noise ratio as 
compared to hybridization of the control nucleic acid to the coding oligonucleotide. 
Again, higher signal to noise ratios can be selected, e.g., about 2.5x, about 5x, about lOx, 
about 20x, about 30x, about 50x or more. The particular signal will depend on the label 
5 used in the relevant assay, e.g., a fluorescent label, a colorimetric label, a radio active 
label, or the like. 

In another aspect, the invention provides a polypeptide that comprises 
unique subsequence in a polypeptide selected from SEQ ID NO:36 to SEQ ID NO:70 and 
SEQ ID NO:79 to SEQ ID NO:85, wherein the unique subsequence is unique as compared 
10 to a polypeptide sequence corresponding to a known interferon-alpha polypeptide, such as, 
e.g., an interferon-alpha polypeptide sequence present in GenBank. 

SUBSTRATES AND FORMATS FOR SEQUENCE RECOMBINATION 

The polynucleotides of the invention are useful as substrates for a variety of 
recombination and recursive recombination (e.g., DNA shuffling) reactions, as well as 

15 other diversity generating techniques, including mutagenesis techniques and standard 
cloning methods as set forth in, e.g., Ausubel, Berger and Sambrook, supra, i.e., to 
produce additional interferon-alpha homologues with desired properties. Based on the 
screening or selection protocols employed, recombinant, e.g., shuffled, interferon-alpha 
homologue polypeptides can be generated and isolated that confer a variety of desirable 

20 characteristics, e.g., enhanced antiviral activity, enhanced antiproliferative activity, 

increased growth inhibitory, cytostatic and/or cytotoxic activities towards particular target 
cells, reduced immunogenicity, etc. 

A variety of diversity generating protocols, including nucleic acid shuffling 
protocols, are available and fully described in the art. The procedures can be used 

25 separately, and/or in combination to produce one or more variants of a nucleic acid or set 
of nucleic acids, as well variants of encoded proteins. Individually and collectively, these 
procedures provide robust, widely applicable ways of generating diversified nucleic acids 
and sets of nucleic acids (including, e.g., nucleic acid libraries) useful, e.g., for the 
engineering or rapid evolution of nucleic acids, proteins, pathways, cells and/or organisms 

30 with new and/or improved characteristics. 

While distinctions and classifications are made in the course of the ensuing 

discussion for clarity, it will be appreciated that the techniques are often not mutually 
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exclusive. Indeed, the various methods can be used singly or in combination, in parallel or 
in series, to access diverse sequence variants. 

The result of any of the diversity generating procedures described herein 
can be the generation of one or more nucleic acids, which can be selected or screened for 
5 nucleic acids that encode proteins with or which confer desirable properties. Following 
diversification by one or more of the methods herein, or otherwise available to one of skill, 
any nucleic acids that are produced can be selected for a desired activity or property, e.g., 
enhanced antiviral activity, enhanced antiproliferative activity, enhanced anti-angiogenic 
activity, increased growth inhibitory, cytostatic and/or cytotoxic activities towards 

10 particular target cells, reduced immunogenicity, etc. Methods for determining nucleic 
acids having enhanced antiviral, antiproliferative, growth inhibitory, cytostatic, and/or 
cytotoxic activity or reduced immunogenicity include those described herein. This can 
include identifying any activity that can be detected, for example, in an automated or 
automatable format, by any of the assays in the art. A variety of related (or even 

15 unrelated) properties can be evaluated, in serial or in parallel, at the discretion of the 
practitioner. 

The following publications describe a variety of diversity generating 
procedures, including recursive recombination procedures, and/or methods for generating 
modified nucleic acid sequences for use in the procedures and methods of the present 

20 invention include the following publications and the references cited therein: Soong, N. 
W. et al. (2000) "Molecular Breeding of Viruses," Nature Genetics 25:436^*39; Stemmer, 
W. et al. (1999) "Molecular breeding of viruses for targeting and other clinical 
properties/' Tumor Targeting 4:1-4; Ness et al (1999) "DNA Shuffling of subgenomic 
sequences of subtilisin," Nature Biotechnology 17:893-896; Chang et al (1999) 

25 "Evolution of a cytokine using DNA family shuffling," Nature Biotechnology 17:793-797; 
Minshull and Stemmer (1999) "Protein evolution by molecular breeding," Current 
Opinion in Chemical Biology 3:284-290; Christians et al (1999) "Directed evolution of 
thymidine kinase for AZT phosphorylation using DNA family shuffling," Nature 
Biotechnology 17:259-264; Crameri et al. (1998) "DNA shuffling of a family of genes 

30 from diverse species accelerates directed evolution," Nature 391:288-291; Crameri et al. 
(1997) "Molecular evolution of an arsenate detoxification pathway by DNA shuffling," 
Nature Biotechnology 15:436-438; Zhang et al (1997) "Directed evolution of an effective 
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fucosidase from a galactosidase by DNA shuffling and screening," Proc. Natl Acad. ScL 
USA 94:4504-4509; Patten et al (1997) "Applications of DNA Shuffling to 
Pharmaceuticals and Vaccines," Current Opinion in Biotechnology 8:724-733; Crameri et 
al (1996) "Construction and evolution of antibody-phage libraries by DNA shuffling," 
5 Nature Medicine 2: 100-103; Crameri et al. (1996) "Improved green fluorescent protein by 
molecular evolution using DNA shuffling," Nature Biotechnolo2v 14:315-319; Gates et al 
(1996) "Affinity selective isolation of ligands from peptide libraries through display on a 
lac repressor 'headpiece dimer,'" /. Mol. Biol. 255:373-386; Stemmer (1996) "Sexual PCR 
and Assembly PCR" In: The Encyclopedia of Molecular Biolosv. VCH Publishers, New 

10 York. pp. 447-457; Crameri and Stemmer (1995) "Combinatorial multiple cassette 

mutagenesis creates all the permutations of mutant and wildtype cassettes," BioTechniaues 
18:194-195; Stemmer et al. (1995) "Single-step assembly of a gene and entire plasmid 
form large numbers of oligodeoxy-ribonucleotides" Gene 164:49-53; Stemmer (1995) 
"The Evolution of Molecular Computation," Science 270:1510; Stemmer (1995) 

15 "Searching Sequence Space," Bio/Technology 13:549-553; Stemmer (1994) "Rapid 
evolution of a protein in vitro by DNA shuffling," Nature 370:389-391; and Stemmer 
(1994) "DNA shuffling by random fragmentation and reassembly: In vitro recombination 
for molecular evolution, " Proc. Natl Acad. ScL USA 91:10747-10751. 

Additional details regarding DNA shuffling and other diversity generating 

20 methods can be found in the following U.S. patents, PCT publications, and EP 

publications: USPN 5,605,793 to Stemmer (February 25, 1997), "Methods for In vitro 
Recombination;" USPN 5,811,238 to Stemmer et al. (September 22, 1998) "Methods for 
Generating Polynucleotides having Desired Characteristics by Iterative Selection and 
Recombination;" USPN 5,830,721 to Stemmer et al. (November 3, 1998), "DNA 

25 Mutagenesis by Random Fragmentation and Reassembly;" USPN 5,834,252 to Stemmer 
(November 10, 1998) "End-Complementary Polymerase Reaction;" USPN 5,837,458 to 
Minshull (November 17, 1998), "Methods and Compositions for Cellular and Metabolic 
Engineering;" WO 95/22625, Stemmer and Crameri, "Mutagenesis by Random 
Fragmentation and Reassembly;" WO 96/33207 by Stemmer and Lipschutz, "End 

30 Complementary Polymerase Chain Reaction;" WO 97/20078 by Stemmer and Crameri 
"Methods for Generating Polynucleotides having Desired Characteristics by Iterative 
Selection and Recombination;" WO 97/35966 by Minshull and Stemmer, "Methods and 
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Compositions for Cellular and Metabolic Engineering;" WO 99/41402 by Punnonen et al. 
"Targeting of Genetic Vaccine Vectors;" WO 99/41383 by Punnonen et al, "Antigen 
Library Immunization;" WO 99/41369 by Punnonen et al, "Genetic Vaccine Vector 
Engineering;" WO 99/41368 by Punnonen et al, "Optimization of Immunomodulatory 
5 Properties of Genetic Vaccines;" EP 752008 by Stemmer and Crameri, "DNA 

Mutagenesis by Random Fragmentation and Reassembly;" EP 0932670 by Stemmer 
"Evolving Cellular DNA Uptake by Recursive Sequence Recombination;" WO 99/23107 
by Stemmer et al, "Modification of Virus Tropism and Host Range by Viral Genome 
Shuffling;" WO 99/21979 by Apt et al, "Human Papillomavirus Vectors;" WO 98/31837 

10 by Del Cardayre et al. "Evolution of Whole Cells and Organisms by Recursive Sequence 
Recombination;" WO 98/27230 by Patten and Stemmer, "Methods and Compositions for 
Polypeptide Engineering;" EP 0946755 by Patten and Stemmer, "Methods and 
Compositions for Polypeptide Engineering;" and WO 98/13487 by Stemmer et al, 
"Methods for Optimization of Gene Therapy by Recursive Sequence Shuffling and 

15 Selection;" WO 00/00632, "Methods for Generating Highly Diverse Libraries," WO 

00/09679, "Methods for Obtaining in vitro Recombined Polynucleotide Sequence Banks 
and Resulting Sequences," WO 98/42832 by Arnold et al, "Recombination of 
Polynucleotide Sequences Using Random or Defined Primers," WO 99/29902 by Arnold 
et al, "Method for Creating Polynucleotide and Polypeptide Sequences," WO 98/41653 

20 by Vind, "An in vitro Method for Construction of a DNA Library," WO 98/41 622 by 
Borchert et al., "Method for Constructing a Library Using DNA Shuffling," and WO 
98/42727 by Pati and Zarling, "Sequence Alterations using Homologous Recombination." 

Certain U.S. applications provide additional details regarding DNA 
shuffling and related techniques, as well as other diversity generating methods, including 

25 "SHUFFLING OF CODON ALTERED GENES" by Patten et al. filed September 29, 

1998 (USSN 60/102,362), January 29, 1999 (USSN 60/117,729), and September 28, 1999 
(USSN 09/407,800); "EVOLUTION OF WHOLE CELLS AND ORGANISMS BY 
RECURSIVE SEQUENCE RECOMBINATION", by Del Cardayre et al. filed July 15, 
1998 (USSN 09/166,188), and July 15, 1999 (USSN 09/354,922); 

30 "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACID RECOMBINATION" by 
Crameri et al., filed February 5, 1999 (USSN 60/118,813), June 24, 1999 (USSN 
60/141,049), and September 28, 1999 (USSN 09/408,392); "USE OF CODON-BASED 
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OLIGONUCLEOTIDE SYNTHESIS FOR SYNTHETIC SHUFFLING" by Welch et al., 
filed September 28, 1999 (USSN 09/408,393); "METHODS FOR MAKING 
CHARACTER STRINGS, POLYNUCLEOTIDES & POLYPEPTIDES HAVING 
DESIRED CHARACTERISTICS" by Selifonov and Stemmer, filed February 5, 1999 
5 (USSN 60/1 18854) and October 12, 1999 (USSN 09/416,375); RECOMBINATION OF 
INSERTION MODIFIED NUCLEIC ACIDS by Patten et al, filed March 5, 1999 (USSN 
60/122,943), July 2, 1999 (USSN 60/142,299), November 10, 1999 (USSN 60/164,618), 
and November 10, 1999 (USSN 60/164,617); and "SINGLE-STRANDED NUCLEIC 
ACID TEMPLATE-MEDIATED RECOMBINATION AND NUCLEIC ACID 

10 FRAGMENT ISOLATION" by Affholter, USSN 60/186,482 filed March 2, 2000. 

As a review of the foregoing publications, patents, published foreign 
applications and U.S. patent applications reveals, diversity generation methods, such as 
shuffling (or "recursive recombination") of nucleic acids, to provide new nucleic acids 
with desired properties can be carried out by a number of established methods. Any of 

15 these methods can be adapted to the present invention to evolve the alpha interferons 
discussed herein to produce new alpha interferon homologues with new or improved 
properties. Both the methods of making such interferons and the interferons (e.g., IFN 
homologues) produced by these methods are a feature of the invention.In brief, several 
different general classes of sequence modification methods, such as recombination, are 

20 applicable to the present invention and set forth, e.g., in the references above. First, 

nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the 
references above, including e.g., DNAse digestion of nucleic acids to be recombined 
followed by ligation and/or PCR reassembly of the nucleic acids. Second, nucleic acids 
can be recursively recombined in vivo or ex vivo, e.g., by allowing recombination to occur 

25 between nucleic acids in cells. Third, whole genome recombination methods can be used 
in which whole genomes of cells or other organisms are recombined, optionally including 
spiking of the genomic recombination mixtures with desired library components {e.g., 
genes corresponding to the pathways of the present invention). Fourth, synthetic 
recombination methods can be used, in which oligonucleotides corresponding to targets of 

30 interest are synthesized and reassembled in PCR or ligation reactions which include 
oligonucleotides which correspond to more than one parental nucleic acid, thereby 
generating new recombined nucleic acids. Oligonucleotides can be made by standard 
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nucleotide addition methods, or can be made, e.g., by tri -nucleotide synthetic approaches. 

Fifth, in silico methods of recombination can be effected in which genetic algorithms are 

used in a computer to recombine sequence strings which correspond to homologous (or 

even non -homologous) nucleic acids. The resulting recombined sequence strings are 

» 

5 optionally converted into nucleic acids by synthesis of nucleic acids which correspond to 
the recombined sequences, e.g., in concert with oligonucleotide synthesis/ gene 
reassembly techniques. Any of the preceding general recombination formats can be 
practiced in a reiterative fashion to generate a more diverse set of recombinant nucleic 
acids. Sixth, methods of accessing natural diversity, e.g., by hybridization of diverse 

10 nucleic acids or nucleic acid fragments to single-stranded templates, followed by 

polymerization and/or ligation to regenerate full-length sequences, optionally followed by 
degradation of the templates and recovery of the resulting modified nucleic acids can be 
used, above references provide these and other basic recombination formats as well as 
many modifications of these formats. Regardless of the format which is used, the nucleic 

15 acids of the invention can be recombined (with each other, or with related (or even 

unrelated) nucleic acids to produce a diverse set of recombinant nucleic acids, including 
e.g., homologous nucleic acids. In general, the sequence recombination techniques 
described herein provide particular advantages in that they provide for recombination 
between the nucleic acids of SEQ ID NO:l to SEQ ID NO:35, and SEQ ID NO:72 to SEQ 

20 ID NO:78, or fragments or variants thereof, in any available format, thereby providing a 
very fast way of exploring the manner in which different combinations of sequences can 
affect a desired result. 

Following recombination, any nucleic acids which are produced can be 
screened or selected for a desired activity. In the context of the present invention, this can 

25 include testing for and identifying any activity that can be detected, e.g., in an automatable 
format, by any assay known in the art. In addition, useful properties such as low 
immunogenicity, increased half-life, improved solubility, oral availability, or the like can 
also be selected for. A variety of alpha-interferon related (or even unrelated) properties 
can be assayed for, using any available assay. 

30 DNA mutagenesis and shuffling provide a robust, widely applicable, means 

of generating diversity useful for the engineering of proteins, pathways, cells and 
organisms with improved characteristics. In addition to the basic formats described above, 
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it is sometimes desirable to combine shuffling methodologies with other techniques for 
generating diversity. In conjunction with (or separately from) shuffling methods, a variety 
of diversity generation methods can be practiced and the results (i.e., diverse populations 
of nucleic acids) screened for in the systems of the invention. Additional diversity can be 
5 introduced by methods which result in the alteration of individual nucleotides or groups of 
contiguous or non-contiguous nucleotides, i.e., mutagenesis methods. Many mutagenesis 
methods are found in the above-cited references; additional details regarding mutagenesis 
methods can be found in the references listed below. 

Mutagenesis methods of generating diversity include, for example, 

10 recombination (PCTAJS98/05223; Publ. No. W098/42727); site-directed mutagenesis 
(Ling et al. (1997) "Approaches to DNA mutagenesis: an overview," Anal. Biochem. 
254(2): 157-178; Dale et al. (1996) "Oligonucleoti de-directed random mutagenesis using 
the phosphorothioate method," Methods Mol. Biol. 57:369-374; Smith (1985) "In vitro 
mutagenesis," Ann. Rev. Genet. 19:423-462; Botstein & Shortle (1985) "Strategies and 

15 applications of in vitro mutagenesis," Science 229: 1 193-1201; Carter (1986) "Site- 
directed mutagenesis," Biochem. J. 237:1-7; and Kunkel (1987) 'The efficiency of 
oligonucleotide directed mutagenesis," in Nucleic Acids & Molecular Biology (Eckstein, 
F. and Lilley, D.M.J, eds., Springer Verlag, Berlin)); mutagenesis using uracil containing 
templates (Kunkel (1985) "Rapid and efficient site-specific mutagenesis without 

20 phenotypic selection," Proc. NatM Acad. Sci. USA 82:488-492; Kunkel et al. (1987) 
"Rapid and efficient site-specific mutagenesis without phenotypic selection," Results 
Probl. Cell Differ. 154, 367-382; and Bass et al. (1988) "Mutant Trp repressors with new 
DNA-binding specificities," Science 242:240-245); oligonucleoti de-directed mutagenesis 
(Results Probl. Cell Differ. 100:468-500 (1983); Results Probl. Cell Differ. 154:329-350 

25 (1987); Zoller & Smith (1982) "Oligonucleotide-directed mutagenesis using M13-derived 
vectors: an efficient and general procedure for the production of point mutations in any 
DNA fragment," Nucleic Acids Res. 10:6487-6500; Zoller & Smith (1983) 
"Oligonucleotide-directed mutagenesis of DNA fragments cloned into M13 vectors," 
Results Probl. Cell Differ. 100:468-500; and Zoller & Smith (1987) "Oligonucleotide- 

30 directed mutagenesis: a simple method using two oligonucleotide primers and a single- 
stranded DNA template," Results Probl. Cell Differ. 154:329-350); phosphorothioate- 
modified DNA mutagenesis (Taylor et al. (1985) "The use of phosphorothioate-modified 
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DNA in restriction enzyme reactions to prepare nicked DNA," Nucl. Acids Res. 13:8749- 
8764; Taylor et aJ. (1985) 'The rapid generation of oligonucleotide-directed mutations at 
high frequency using phosphorothioate-modified DNA," Nucl. Acids Res. 13:8765-8787 
(1985); Nakamaye & Eckstein (1986) "Inhibition of restriction endonuclease Nci I 
5 cleavage by phosphorothioate groups and its application to oligonucleotide-directed 

mutagenesis," Nucl. Acids Res. 14:9679-9698; Sayers et al. (1988) "Y-T Exonucleases in 
phosphorothioate-based oligonucleotide-directed mutagenesis," Nucl. Acids Res. 16:791- 
802; and Sayers et al. (1988) "Strand specific cleavage of phosphorothioate-containing 
DNA by reaction with restriction endonucleases in the presence of ethidium bromide," 

10 Nucl. Acids Res. 16:803-814); mutagenesis using gapped duplex DNA (Kramer et al. 
(1984) 'The gapped duplex DNA approach to oligonucleotide-directed mutation 
construction," Nucl. Acids Res. 12:9441-9456; Kramer & Fritz (1987) "Oligonucleotide- 
directed construction of mutations via gapped duplex DNA," Results Probl. Cell Differ. 
154:350-367; Kramer et al. (1988) "Improved enzymatic in vitro reactions in the gapped 

15 duplex DNA approach to oligonucleotide-directed construction of mutations," Nucl. Acids 
Res. 16:7207; and Fritz et al. (1988) "Oligonucleotide-directed construction of mutations: 
a gapped duplex DNA procedure without enzymatic reactions in vitro" Nucl. Acids Res. 
16:6987-6999). 

Additional suitable methods include point mismatch repair (Kramer et al. 

20 (1984) "Point Mismatch Repair," Cell 38:879-887), mutagenesis using repair-deficient 
host strains (Carteret al. (1985) "Improved oligonucleotide site-directed mutagenesis 
using M13 vectors," Nucl. Acids Res. 13:4431-4443; and Carter (1987) "Improved 
oligonucleotide-directed mutagenesis using M13 vectors," Results Probl. Cell Differ. 
154:382-403), deletion mutagenesis (Eghtedarzadeh & Henikoff (1986) "Use of 

25 oligonucleotides to generate large deletions," Nucl. Acids Res. 14:51 15), restriction- 
selection and restriction-selection and restriction-purification (Wells et al. (1986) 
"Importance of hydrogen-bond formation in stabilizing the transition state of subtilisin," 
Phil. Trans. R. Soc. Lond. A 317:415-423), mutagenesis by total gene synthesis (Nambiar 
et al. (1984) 'Total synthesis and cloning of a gene coding for the ribonuclease S protein," 

30 Science 223:1299-1301; Sakamar and Khorana (1988) 'Total synthesis and expression of 
a gene for the a-subunit of bovine rod outer segment guanine nucleotide-binding protein 
(transducing)," Nucl. Acids Res. 14:6361-6372; Wells et al. (1985) "Cassette mutagenesis: 
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an efficient method for generation of multiple mutations at defined sites," Gene 34:315- 
323; and Grundstrom et al. (1985) "Oligonucleotide-directed mutagenesis by microscale 
'shot-gun' gene synthesis," Nucl. Acids Res. 13:3305-3316), double-strand break repair 
(Mandecki (1986) "Oligonucleotide-directed double-strand break repair in plasmids of 
5 Escherichia coli: a method for site-specific mutagenesis," Proc. Nat'l Acad. Sci. USA . 
83:7177-7181). Additional details on many of the above methods can be found in 
Methods in Enzvmology , Vol. 154, which also describes useful controls for trouble- 
shooting problems with various mutagenesis methods. 

Random or semi-random mutagenesis using doped or degenerate 

10 oligonucleotides (Arkin and Youvan (1992) "Optimizing nucleotide mixtures to encode 
specific subsets of amino acids for semi-random mutagenesis," Biotechnology 10:297- 
300; Reidhaar-Olson et al. (1991) "Random mutagenesis of protein sequences using 
oligonucleotide cassettes," Methods Enzvmol . 208:564-86: Lim and Sauer (1991) "The 
role of internal packing interactions in determining the structure and stability of a protein," 

15 J. Mol. Biol. 219:359-76; Breyer and Sauer (1989) "Mutational analysis of the fine 

specificity of binding of monoclonal antibody 51F to lambda repressor," J. Biol. Chem. 
264:13355-60); "Walk-Through Mutagenesis" (Crea, R.; US Patents 5,830,650 and 
5,798,208, and EP Patent 0527809 Bl) may also be employed to generate diversity. 

In one aspect of the present invention, error-prone PCR can be used to 

20 generate nucleic acid variants. Using this technique, PCR is performed under conditions 
where the copying fidelity of the DNA polymerase is low, such that a high rate of point 
mutations is obtained along the entire length of the PCR product. Examples of such 
techniques are found in the references above and, e.g., in Leung et al. (1989) Technique 
1:11-15 and Caldwell et al. (1992) PCR Methods App lic. 2:28-33. Similarly, assembly 

25 PCR can be used, in a process which involves the assembly of a PCR product from a 

mixture of small DNA fragments. A large number of different PCR reactions can occur in 
parallel in the same vial, with the products of one reaction priming the products of another 
reaction. Sexual PCR mutagenesis can be used in which homologous recombination 
occurs between DNA molecules of different but related DNA sequence in vitro, by 

30 random fragmentation of the DNA molecule based on sequence homology, followed by 
fixation of the crossover by primer extension in a PCR reaction. This process is described 
in the references above, e.g., in Stemmer (1994) Proc. Nat'l Acad. Sci. USA 91:10747- 



WO 01/25438 



PCT/US00/27781 



10751. Recursive ensemble mutagenesis can be used in which an algorithm for protein 
mutagenesis is used to produce diverse populations of phenotypically related mutants 
whose members differ in amino acid sequence. This method uses a feedback mechanism 
to control successive rounds of combinatorial cassette mutagenesis. Examples of this 
5 approach are found in Arkin & Youvan (1992) Proc. Nat'l Acad. Sci. USA 89:7811-7815. 

As noted, oligonucleotide directed mutagenesis can be used in a process 
which allows for the generation of site-specific mutations in any nucleic acid sequence of 
interest. Examples of such techniques are found in the references above and, e.g., in 
Reidhaar-Olson et al. (1988) Science , 241:53-57. Similarly, cassette mutagenesis can be 
10 used in a process which replaces a small region of a double stranded DNA molecule with a 
synthetic oligonucleotide cassette that differs from the native sequence. The 
oligonucleotide can contain, e.g., completely and/or partially randomized native 
sequence(s). 

In vivo (or ex vivo) mutagenesis can be used in a process of generating 
15 random mutations in any cloned DNA of interest which involves the propagation of the 
DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair 
pathways. These "mutator" strains have a higher random mutation rate than that of a wild- 
type parent. Propagating the DNA in one of these strains will eventually generate random 
mutations within the DNA. 
20 Exponential ensemble mutagenesis can be used for generating 

combinatorial libraries with a high percentage of unique and functional mutants, where 
small groups of residues are randomized in parallel to identify, at each altered position, 
amino acids which lead to functional proteins. Examples of such procedures are found in 
Delegrave & Youvan (1993) Biotechnology Research 11:1548-1552. Similarly, random 
25 and site-directed mutagenesis can be used. Examples of such procedures are found in 
Arnold (1993) Current Opinion in Biotechnology 4:450-455. 

Kits for mutagenesis, library construction, and other diversity generation 
methods are also commercially available. For example, kits are available from, e.g., 

mm » m ^TfciJ 

Stratagene (e.g., QuickChange site-directed mutagenesis kit; and Chameleon double- 
30 stranded, site-directed mutagenesis kit), Bio/Can Scientific, Bio-Rad (e.g., using the 

Kunkel method described above), Boehringer Mannheim Corp., Clonetech Laboratories, 
DNA Technologies, Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc, 
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Lemargo Inc, Life Technologies (Gibco BRL), New England Biolabs, Pharmacia Biotech, 
Promega Corp., Quantum Biotechnologies, Amersham International pic (e.g., using the 
Eckstein method above), and Anglian Biotechnology Ltd (e.g., using the Carter/Winter 
method above). 

5 Any of the described shuffling or mutagenesis techniques can be used in 

conjunction with procedures which introduce additional diversity into a genome, e.g., a 
bacterial, fungal, animal or plant genome. For example, in addition to the methods above, 
techniques have been proposed which produce chimeric nucleic acid multimers suitable 
for transformation into a variety of species (see, e.g., Schellenberger U.S. Patent No. 

10 5,756,316 and the references above). When such chimeric multimers consist of genes that 
are divergent with respect to one another (e.g., derived from natural diversity or through 
application of site directed mutagenesis, error prone PCR, passage through mutagenic 
bacterial strains, and the like), are transformed into a suitable host, this provides a source 
of nucleic acid diversity for DNA diversification. 

15 Chimeric multimers transformed into host species are suitable as substrates 

for in vivo (or ex vivo) shuffling protocols. Alternatively, a multiplicity of polynucleotides 
sharing regions of partial sequence similarity or homology can be transformed into a host 
species and recombined in vivo (or ex vivo) by the host cell. Subsequent rounds of cell 
division can be used to generate libraries, members of which, comprise a single, 

20 homogenous population of monomelic or pooled nucleic acid. Alternatively, the 
monomelic nucleic acid can be recovered by standard techniques and recursively 
recombined in any of the described shuffling formats. 

Chain termination methods of diversity generation have also been proposed 
(see, e.g., U.S. Patent No. 5,965,408 and the references above). In this approach, double 

25 stranded DNAs corresponding to one or more genes sharing regions of sequence similarity 
or homology are combined and denature, in the presence or absence of primers specific for 
the gene. The single stranded polynucleotides are then annealed and incubated in the 
presence of a polymerase and a chain terminating reagent (e.g., ultraviolet, gamma or X- 
ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as 

30 single strand binding proteins, transcription activating factors, or histones; polycyclic 
aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated 
polymerization mediated by rapid thermocycling; and the like), resulting in the production 
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of partial duplex molecules. The partial duplex molecules, e.g., containing partially 
extended chains, are then denatured and reannealed in subsequent rounds of replication or 
partial replication resulting in polynucleotides which share varying degrees of sequence 
similarity or homology and which are chimeric with respect to the starting population of 
5 DNA molecules. Optionally, the products or partial pools of the products can be amplified 
at one or more stages in the process. Polynucleotides produced by a chain termination 
method, such as described above are suitable substrates for diversity generation methods 
{e.g., RSR, DNA shuffling) according to any of the described formats. 

Diversity can be further increased by using methods which are not 

10 homology based with DNA shuffling (which, as set forth in the above publications and 
applications can be homology or non-homology based, depending on the precise format). 
For example, incremental truncation for the creation of hybrid enzymes (ITCHY) 
described in Ostermeier et al. (1999) "A combinatorial approach to hybrid enzymes 
independent of DNA homology" Nature Biotech . 17: 1205, can be used to generate an 

15 initial recombinant library which serves as a substrate for one or more rounds of in vitro, 
ex vivo, or in vivo diversity generation methods {e.g., RSR or shuffling methods). 

Methods for generating multispecies expression libraries have been 
described (e.g., U.S. Patent Nos. 5,783,431; 5,824,485 and the references above) and their 
use to identify protein activities of interest has been proposed (U.S. Patent 5,958,672 and 

20 the references above). Multispecies expression libraries are, in general, libraries 

comprising cDNA or genomic sequences from a plurality of species or strains, operably 
linked to appropriate regulatory sequences, in an expression cassette. The cDNA and/or 
genomic sequences are optionally randomly concatenated to further enhance diversity. 
The vector can be a shuttle vector suitable for transformation and expression in more than 

25 one species of host organism, e.g., bacterial species, eukaryotic cells. In some cases, the 
library is biased by preselecting sequences which encode a protein of interest, or which 
hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for 
any of the methods herein described. 

In some applications, it is desirable to preselect or prescreen libraries (e.g., 

30 an amplified library, a genomic library, a cDNA library, a normalized library, etc.) or 

other substrate nucleic acids prior to shuffling, or to otherwise bias the substrates towards 
nucleic acids that encode functional products (shuffling procedures can also, 
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independently have these effects). For example, in the case of antibody engineering, it is 
possible to bias the shuffling process toward antibodies with functional antigen binding 
sites by taking advantage of in vivo (or ex vivo or in vitro) recombination events prior to 
diversity generation {e.g., DNA shuffling) by any described method. For example, 
5 recombined CDRs derived from B cell cDNA libraries can be amplified and assembled 
into framework regions (e.g., Jirholt et al. (1998) "Exploiting sequence space: shuffling in 
vivo formed complementarity determining regions into a master framework," Gene 
215:471) prior to diversity generation (e.g., DNA shuffling) according to any of the 
methods described herein. 

10 Libraries can be biased towards nucleic acids which encode proteins with 

desirable activities (e.g., binding affinities, enzymatic activities, anti-viral activities, 
ability to induce an immune response, antiproliferative activities, adjuvant properties, 
etc.). For example, after identifying a clone from a library which exhibits a specified 
activity, the clone can be mutagenized using any known method for introducing DNA 

15 alterations, including, but not restricted to, DNA shuffling or another form of recursive 
sequence recombination or diversity generation. A library comprising the mutagenized 
homologues is then screened for a desired activity, which can be the same as or different 
from the initially specified activity. An example of such a procedure is proposed in U.S. 
Patent No. 5,939,250. Desired activities can be identified by any method known in the art. 

20 For example, WO 99/10539 proposes that gene libraries can be screened by combining 

extracts from the gene library with components obtained from rnetabolically rich cells and 
identifying combinations which exhibit the desired activity. It has also been proposed 
(e.g., WO 98/58085) that clones with desired activities can be identified by inserting 
bioactive substrates into samples of the library, and detecting bioactive fluorescence 

25 corresponding to the product of a desired activity using a fluorescent analyzer, e.g., a flow 
cytometry device, a CCD, a fluorometer, or a spectrophotometer. 

Libraries can also be biased towards nucleic acids which have specified 
characteristics, e.g., hybridization to a selected nucleic acid probe. For example, 
application WO 99/10539 proposes that polynucleotides encoding a desired activity (e.g., 

30 an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a 

glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a 
hydratase, a nitrilase, a transaminase, an amidase or an acylase) can be identified from 
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among genomic DNA sequences in the following manner. Single stranded DNA 
molecules from a population of genomic DNA are hybridized to a ligand-conjugated 
probe. The genomic DNA can be derived from either a cultivated or uncultivated 
microorganism, or from an environmental sample. Alternatively, the genomic DNA can be 
5 derived from a multicellular organism, or a tissue derived therefrom. 

Second strand synthesis can be conducted directly from the hybridization 
probe used in the capture, with or without prior release from the capture medium or by a 
wide variety of other strategies known in the art. Alternatively, the isolated single- 
stranded genomic DNA population can be fragmented without further cloning and used 

10 directly in a shuffling-based gene reassembly process. In one such method the fragment 
population derived the genomic library(ies) is annealed with partial, or, often 
approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly 
of complex chimeric genes from this population is the mediated by nuclease-base removal 
of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and 

15 subsequent single stranded ligation. The parental strand can be removed by digestion (if 
RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in 
a manner conducive to such separation) and other available separation/purification 
methods. Alternatively, the parental strand is optionally co-purified with the chimeric 
strands and removed during subsequent screening and processing steps. As set forth in 

20 "Single-stranded nucleic acid template-mediated recombination and nucleic acid fragment 
isolation" by Affholter (USSN 60/186,482, filed March 2, 2000) and WO 98/27230, 
"Methods and Compositions for Polypeptide Engineering" by Patten and Stemmer, 
shuffling using single-stranded templates and nucleic acids of interest which bind to a 
portion of the template can also be performed. 

25 In one approach, single-stranded molecules are converted to double- 

stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand- 
mediated binding. After separation of unbound DNA, the selected DNA molecules are 
released from the support and introduced into a suitable host cell to generate a library 
enriched sequences which hybridize to the probe. A library produced in this manner 

30 provides a desirable substrate for any of the shuffling reactions described herein. 

"Non-Stochastic" methods of generating nucleic acids and polypeptides are 
alleged in Short, J. "Non-Stochastic Generation of Genetic Vaccines and Enzymes," WO 
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00/46344. These methods, including the proposed non-stochastic polynucleotide 
reassembly and gene site saturation mutagenesis and synthetic ligation polynucleotide 
reassembly methods outlined therein, can be applied to the present invention as well. 

It will readily be appreciated that any of the above described techniques 
5 suitable for enriching a library prior to diversification can also be used to screen the 
products, or libraries of products, produced by the diversity generating methods. 

A recombinant nucleic acid produced by recursively recombining one or 
more polynucleotides of the invention with one or more additional nucleic acids also 
forms a part of the invention. The one or more additional nucleic acids may include 

10 another polynucleotide of the invention; optionally, alternatively, or in addition, the one or 
more additional nucleic acids can include, e.g., a nucleic acid encoding a naturally- 
occurring interferon-alpha or a subsequence thereof, or any homologous interferon-alpha 
sequence or subsequence thereof, or an interferon-beta sequence or subsequence thereof 
(e.g., an interferon-alpha or interferon-beta sequence as found in GenBank or other 

15 available literature), or, e.g., any other homologous or non-homologous nucleic acid 

(certain recombination formats noted above, notably those performed synthetically or in 
silico, do not require homology for recombination). 

The recombining steps may be performed in vivo, ex vivo, in vitro, or in 
silico as described in more detail in the references above. Also included in the invention 

20 is a cell containing any resulting recombinant nucleic acid, nucleic acid libraries produced 
by diversity generation, recombination, or recursive recombination of the nucleic acids set 
forth herein, and populations of cells, vectors, viruses, plasmids or the like comprising the 
library or comprising any recombinant nucleic acid resulting from diversity generation or 
recombination (or recursive recombination) of a nucleic acid as set forth herein with 

25 another such nucleic acid, or an additional nucleic acid. Corresponding sequence strings 
in a database present in a computer system or computer readable medium are a feature of 
the invention. 

OTHER POLYNUCLEOTIDE COMPOSITIONS 

The invention also includes compositions comprising two or more 
30 polynucleotides of the invention (e.g., as substrates for recombination). The composition 
can comprise a library of recombinant nucleic acids, where the library contains at least 2, 
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3, 5, 10, 20, or 50 or more nucleic acids. The nucleic acids are optionally cloned into 
expression vectors, providing expression libraries. 

The invention also includes compositions produced by digesting one or 
more polynucleotides of the invention with a restriction endonuclease, an RNAse, or a 
5 DNAse (e.g., as is performed in certain of the recombination formats noted above); and 
compositions produced by fragmenting or shearing one or more polynucleotides of the 
invention by mechanical means (e.g., sonication, vortexing, and the like), which can also 
be used to provide substrates for recombination in the methods above. Similarly, 
compositions comprising sets of oligonucleotides corresponding to more than one nucleic 

10 acids of the invention are useful as recombination substrates and are a feature of the 
invention. For convenience, these fragmented, sheared, or oligonucleotide synthesized 
mixtures are referred to as fragmented nucleic acid sets. 

Also included in the invention are compositions produced by incubating 
one or more of the fragmented nucleic acid sets in the presence of ribonucleotide- or 

15 deoxyribonucelotide triphosphates and a nucleic acid polymerase. This resulting 

composition forms a recombination mixture for many of the recombination formats noted 
above. The nucleic acid polymerase may be an RNA polymerase, a DNA polymerase, or 
an RNA-directed DNA polymerase (e.g., a "reverse transcriptase"); the polymerase can 
be, e.g., a thermostable DNA polymerase (such as, VENT, TAQ, or the like). 

20 INTERFERON HOMOLOGUE POLYPEPTIDES 

The invention provides isolated or recombinant interferon-alpha homologue 
polypeptides, also referred to herein as "interferon-alpha homologues," or "interferon 
homologues" or "IFN-alpha homologues" or "IFN homologues". An isolated or 
recombinant interferon homologue polypeptide of the invention includes a polypeptide 

25 comprising a sequence selected from SEQ ID NO:36 to SEQ ID NO:70 and SEQ ID 

NO:79 to SEQ ID NO:85, and conservatively modified variations thereof, and fragments 
thereof having an antiproliferative activity in, e.g., a human Daudi cell line-based assay (or 
other similar assay) and/or an antiviral activity in, e.g., a murine cell line/EMCV-based 
assay (or other similar assay). An alignment of exemplary interferon homologue 

30 polypeptide sequences according to the invention is provided in Fig. 1. Alignment of the 
polypeptide sequences of the invention to each other or to sequences of known, naturally- 
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occurring interferon-alphas is readily performed by one of ordinary skill in the art using 
publicly available databases and alignment programs. 

The invention also provides a polypeptide comprising at least about 100, 
120, 130, 140, 150, 155, 160, 163, 165, or 166 contiguous amino acids of any one of SQ 
5 ID NOS:36-70 or SEQ ID NO:71. In one aspect, said amino acid sequence comprises 
amino acids Lysl60 and Glul66, wherein the numbering of the amino acids in the 
sequence corresponds to that of SEQ ED NO:36. 

Several conclusions may be drawn from comparison of the exemplary 
sequences of the invention (Fig. 1) to sequences of known, naturally-occurring interferon- 
10 alphas and other Type I interferons (including beta, delta, omega, and tau-interferons) 

from human and non-human sources. Such sequences are readily available from a variety 
of sources, such as GenBank, and the Pfam (Protein Families) database at 
http://www.sanger.ac.uk/Software/Pfam/index.shtml. 

Of particular note is the presence, in some interferon homologue 
15 polypeptide sequences of the invention, of the following amino acid residues (denoted 
"Group I" residues) which do not appear in the equivalent position of known, naturally- 
occurring human or non-human Type 1 interferon sequences. 

Group I : Aspll; Prol4; Arg50; Phe55; Asp75; Asn80; Prolll; Leul24; 
Glul34; Serl40, and Alal43; with residue numbering corresponding to the mature 
20 interferon homologue sequence identified as SEQ ID NO:36. 

Also of note is the presence, in some interferon homologue polypeptide 
sequences of the invention, of the following amino acid residues (denoted "Group II" 
residues) which do not appear in the equivalent position of known, naturally-occurring 
human interferon-alpha subtype sequences. 
25 Group n : Pro9; (Lys, Ser)12; (Thr, Val)24; Gln34; Arg40; Ser45; Arg47; 

Leu56; Ile60; Phe67; Ala79, Gly88; His90; Arg91; Glu95; VallOl; (Gly, AIa)104; 
Vail 12; Glyll4; Proll6; Lysl33, and Hisl36. 

In other embodiments, the interferon homologue polypeptide comprises at 
least 20, 50, 100, 150, 155, or 160 of more contiguous amino acids of any one of SEQ ID 
30 NOS:36-70 and/or one or more of amino acids Alal9, (Tyr or Gln)34, Gly37, Phe38, 
Lys71, Ala76, Tyr90, Ilel32, Argl34, Phel52, Lysl60, and Glul66, wherein the 
numbering of the amino acids corresponds to that of SEQ ID NO:36, or one or more of 
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amino acids Pro9, (Lys or Ser)12, (Thr or Val)24, Gln34, Arg40, Ser45, Arg47, Leu56, 
Ile60, Phe67, Ala79, Gly88, His90, Arg91, Glu95, VallOl, (Gly, Ala)104, Valll2, 
Glyll4, Proll6, Lysl33, and Hisl36, wherein the numbering of the amino acids in said 
polypeptide sequence corresponds to the numbering of individual amino acids in the 
5 amino acid sequence of SEQ ID NO:36. Thus, for example, in this embodiment, an 

interferon polypeptide comprises an amino acid sequence comprising a proline residue at 
amino acid position 9 in the sequence, a lysine or serine residue at position 12, a threonine 
or valine residue at position 24, a glutamine residue at position 34, an arginine residue at 
position 40, etc. Such polypeptides may exhibit antiproliferative activities in a human 

10 Daudi cell line-based proliferation assay (e.g., at least about 8.3xl0 6 units/mg) and/or an 
antiviral activities in a human WISH cell/EMCV-based assay (at least about 2.1xl0 7 
units/mg). Some such polypeptides bind a human alpha interferon receptor. Some such 
polypeptides are 166 amino acids in length. In another aspect, such polypeptides may 
comprise a sequence selected from any of the group of SEQ ID NO:36 to SEQ ID NO:54. 

15 An antiproliferative activity of any polypeptide of the invention generally 

relates to the capability or ability of a polypeptide to cause cells or parts thereof to grow or 
produce new cellular growth rapidly and often repeatedly. 

The invention further includes a polypeptide (e.g., any of SEQ ID NOS:36- 
71 or SEQ ID NOS:79-85) or a nucleic acid (e.g., any of SEQ ID NOS:l-35 or SEQ ID 

20 NOS:72-78)encoding a polypeptide, wherein said polypeptide having an anti-angiogenic 
activity as measured by an anti-angiogenesis assay well known to those of ordinary skill in 
the art. 

The invention further includes: 

(a) any interferon-alpha polypeptide comprising one or more Group I 
25 amino acid residues above. 

(b) any interferon-alpha polypeptide comprising one or more Group II 
amino acid residues above in the context of a human like interferon sequence (i.e., a 
sequence which displays a high level of similarity or homology to a human interferon), or 
a sequence which is highly similar or homologous (i.e., having a percent sequence 

30 homology or sequence identity of at least about 80%, 90%, 95%, 96%, 97%, 98% or 
more) to any sequence listed in the attached sequence listing or fragment thereof. 



69 



WO 01/25438 



PCT/USOO/27781 



(c) any interferon-alpha polypeptide containing a combination of the 
following residues, which are localized in or near the regions of the interferon-alpha 
molecule known or proposed to interact with a Type I interferon receptor, where such 
sequence combinations (motifs) do not appear in the equivalent position of any known 
5 naturally-occurring human or non-human Type 1 interferon: 

(i) (Tyr or Gln)34; plus one or more of Ilel32 or Argl34; or 

(ii) Asp78, Glu79, or (Asp or Thr)80; plus one or more of Ilel32 or 

Argl34. 

In another embodiment, the present invention provides an interferon alpha 
10 homologue comprising the sequence show in SEQ ID NO:71: CDLPQTHSLG-Xi i-Xi 2 - 
R A-X , 5-X 1 6-LL-X , 9-QM-X22-R-X24-S -X 26 -FS CLKDR-X34-DFG-X38-P-X40-EEFD-X45- 

X46-X47-FQ-X50-X5 , -Q AI-X 5 5-X 5 6-Xs7-^^ 

W-X 7 8-X 7 9-X 8 o-IX-X83-K-X 85 -X^ 

Xio6-X 1( n-X,o8-TPL^ 

15 KYSPC-X140-WEVVRAEIMRSFSFSTNLQKRLRRKE, or a conservatively substituted 
variation thereof, where X n is N or D; X !2 is R, S, or K; X J5 is L or M; Xi 6 is I, M, or V; 
X19 is A or G; X22 is G or R; X24 is I or T; X 2 6 is P or H; X 34 is H, Y or Q; X 38 is F or L; 
X40 is Q or R; X45 is G or S; X46 is N or H; X47 is Q or R; X 50 is K or R; X 5] is A or T; X55 
is S or F; X 5 6 is V or A; X 57 is L or F; X^ is M or I; X6i is I or M; X 0 7 is L or F; X72 is D 

20 or N; X 75 is A or V; X 76 is A or T; X 78 is E or D; X79 is Q or E; X 80 is S, R, T, or N; X 83 is 
E or D; X 85 is F or L; X 86 is S or Y; X 88 is E or G; X90 is Y, H, N; X 95 is D, E, or N; Xi 0 i is 
I, M, or V; X103 is E or G; X105 is G or W; X106 is V or M; X107 is E, G, or K; X 108 is E or 
G; X114 is V, E, or G; Xn 6 is S or P; X121 is KorR; X124 is For L; X132 is T, I, or M; X134 
is K or R; and X140 is A or S; or a fragment of said SEQ ID NO:71 . In another aspect, the 

25 interferon homologue polypeptide of SEQ ID NO:71, or a fragment thereof, exhibits an 
antiproliferative activity in a human Daudi cell line-based proliferation assay (at least 
about 8.3xl0 6 units/mg) and/or an antiviral activity in a human WISH cell/EMCV-based 
assay (at least about 2.1xl0 7 units/mg). Both such assays are discussed in greater detail 
below. Such polypeptide may comprise an amino acid sequence of the group of from SEQ 

30 ID NO:36 to SEQ ID NO:54 or may be encoded by a nucleotide sequence of the group of 
from SEQ ID NO:l to SEQ ID NO: 19. 
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Fragments of the interferon homologue polypeptides described herein are 
also a feature of the invention. An interferon alpha homologue fragment of the invention 
typically comprises an interferon homologue polypeptide comprising at least about 20, 25, 
or 30, and typically at least about 40, 50, 60, 70, 80, 90, or 100 contiguous amino acids of 
5 any one of SEQ ID NOS:36-71 or SEQ ID NOS:79-85. In other embodiments, the 
fragment comprises usually at least about 100, 110, 120, 125, 130, 140, 150, 155, 158, 
160, 162, 163, 164, or 165 contiguous amino acids of any one of SEQ ID NOS:36-71 or 
SEQ ID NOS:79-85. Such polypeptide fragments may have an antiproliferative activity in 
a human Daudi cell line-based assay and/or an antiviral activity in a human or murine cell 
10 line/EMCV-based assay. 

In other embodiments, the invention provides polypeptides having a length 
of 166 amino acids, and, in some such embodiments, such polypeptides have an 
antiproliferative activity in a human Daudi cell line-based assay (or other similar assay), 
including, e.g., at least about 8.3xl0 6 units/mg, and/or an antiviral activity in a human 
15 WISH cell line/EMCV-based assay (or other similar assay), including, e.g., at least about 
2.1xl0 7 units/mg. 

In other embodiments, the invention provides a polypeptide comprising at 
least 100, 150, 155, or 160 contiguous amino acids of a protein encoded by a coding 
polynucleotide sequence comprising any of the following: (a) SEQ ID NO:l to SEQ ID 

20 NO:35 or SEQ ID NO:72 to SEQ ED NO:78; (b) a coding polynucleotide sequence that 

encodes a first polypeptide selected from any of SEQ ID NO:36 to SEQ ID NO:70 or SEQ 
ID NO:79 to SEQ ID NO:85; and (c) a complementary polynucleotide sequence that 
hybridizes under at least highly stringent (or ultra-high stringent or ultra-ultra- high 
stringent conditions) hybridization conditions over substantially the entire length of a 

25 polynucleotide sequence of (a) or (b). Such polypeptides may have an antiproliferative 

activity in a human Daudi cell line-based assay (or other similar assay), and/or an antiviral 
activity in a human WISH cell line/EMCV-based assay (or other similar assay). Some 
such polypeptides of the invention specifically bind a human alpha interferon receptor. 
The polypeptides and nucleic acids of the subject invention need not be identical, but can 

30 be substantially identical, to the corresponding sequence of the target molecule or related 
molecule, including the polypeptides of any of SEQ ID NOS:36-71 or fragments thereof 
(including those having antiviral or antiproliferative activities in the assays described 
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herein), or the nucleic acids of any of SEQ ED NOS:l-35 or fragments thereof (including 
those having antiviral or antiproliferative activities in the assays described herein). The 
polypeptides can be subject to various changes, such as insertions, deletions, and 
substitutions, either conservative or non-conservative, where such changes might provide 
5 for certain advantages in their use. The polypeptides of the invention can be modified in a 
number of ways so long as they comprise a sequence substantially identical (as defined 
below) or having a percent identity to a sequence in the naturally occurring or known 
interferon polypeptide molecule. 

Alignment and comparison of relatively short amino acid sequences (less 

10 than about 30 residues) is typically straightforward. Comparison of longer sequences can 
require more sophisticated methods to achieve optimal alignment of two sequences. 
Optimal alignment of sequences for aligning a comparison window can be conducted by 
the local homology algorithm of Smith and Waterman (1981) Adv. Appi Math. 2:482, by 
the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 

15 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l 
Acad. Sci. (USA) 85:2444, by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 
7.0, Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection, and the 
best alignment (i.e., resulting in the highest percentage of sequence similarity over the 

20 comparison window) generated by the various methods is selected. 

The term sequence identity means that two polynucleotide sequences are 
identical (i.e., on a nucleotide-by-nucleotide basis) over a window of comparison. The 
term "percentage of sequence identity" or "percent sequence identity" is calculated by 
comparing two optimally aligned sequences over the window of comparison, determining 

25 the number of positions at which the identical residues occurs in both sequences to yield 
the number of matched positions, dividing the number of matched positions by the total 
number of positions in the window of comparison (i.e., the window size), and multiplying 
the result by 100 to yield the percentage of sequence identity. In one aspect, the present 
invention provides interferon homologue nucleic acids having at least about 80%, 85%, 

30 90%, 95%, 96%, 97%, 98%, 99%, 99.5% or more percent sequence identity with the 
nucleic acids of any of SEQ ID NOS:l-35 or SEQ ID NOS:72-78 or fragments thereof. 
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As applied to polypeptides, the term substantial identity means that two 
peptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT 
using default gap weights (described in detail below), share at least about 80 percent 
sequence identity, preferably at least about 90 percent sequence identity, more preferably 
5 at least about 95 percent sequence identity or more (e.g., 97, 98, or 99 percent sequence 
identity). Preferably, residue positions which are not identical differ by conservative 
amino acid substitutions. Conservative amino acid substitutions refer to the 
interchangeability of residues having similar side chains. For example, a group of amino 
acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a 

10 group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a 

group of amino acids having amide-containing side chains is asparagine and glutamine; a 
group of amino acids having aromatic side chains is phenylalanine, tyrosine, and 
tryptophan; a group of amino acids having basic side chains is lysine, arginine, and 
histidine; and a group of amino acids having sulfur-containing side chains is cysteine and 

15 methionine. Preferred conservative amino acids substitution groups are: valine-leucine- 
isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine- valine, and asparagine- 
glutamine. In one aspect, the present invention provides interferon homologue 
polypeptides having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98% 99% 99.5% or 
more percent sequence identity with the polypeptides of any of SEQ ED NOS:36-71 or 

20 SEQ ID NOS:79-85 or fragments thereof. 

A preferred example of an algorithm that is suitable for determining percent 
sequence identity and sequence similarity is the FASTA algorithm, which is described in 
Pearson, W.R. & Lipman, D.J., 1988, Proc. Nat'l Acad. Sci. USA 85: 2444. See also W. 
R. Pearson, 1996, Methods Enzymol. 266: 227-258. Preferred parameters used in a 

25 FASTA alignment of DNA sequences to calculate percent identity are optimized, BL50 
Matrix 15: -5, k-tuple= 2; joining penalty= 40, optimization= 28; gap penalty -12, gap 
length penalty =-2; and width= 16. 

Another preferred example of algorithm that is suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 

30 algorithms, which are described in Altschul et al. y 1977, Nuc. Acids Res. 25: 3389-3402 
and Altschul et a/., 1990, J. Mol Biol. 215: 403-410, respectively. BLAST and BLAST 
2.0 are used, with the parameters described herein, to determine percent sequence identity 
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for the nucleic acids and proteins of the invention. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 
(http: //www.ncbi. nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, 
5 which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood word 
score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds 
for initiating searches to find longer HSPs containing them. The word hits are extended in 
both directions along each sequence for as far as the cumulative alignment score can be 

10 increased. Cumulative scores are calculated using, for nucleotide sequences, the 

parameters M (reward score for a pair of matching residues; always > 0) and N (penalty 
score for mismatching residues; always < 0). For amino acid sequences, a scoring matrix 
is used to calculate the cumulative score. Extension of the word hits in each direction are 
halted when: the cumulative alignment score falls off by the quantity X from its maximum 

15 achieved value; the cumulative score goes to zero or below, due to the accumulation of 

one or more negative-scoring residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the 
alignment. The BLASTN program (for nucleotide sequences) uses as defaults a 
wordlength (W) of 11, an expectation (E) of 10, M=5, N=-4 and a comparison of both 

20 strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 
3, and expectation (E) of 10, and the BLOSUM62 scoring matrix {see Henikoff & 
Henikoff (1989) Proc. Nat'l Acad. Sci. U.S.A. 89: 10915) alignments (B) of 50, 
expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 

25 between two sequences {see, e.g., Karlin & Altschul (1993) Proc. Nat'l Acad. ScL U.S.A. 
90: 5873-5787). One measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N)), which provides an indication of the probability by which 
a match between two nucleotide or amino acid sequences would occur by chance. For 
example, a nucleic acid is considered similar to a reference sequence if the smallest sum 

30 probability in a comparison of the test nucleic acid to the reference nucleic acid is less 
than about 0.2, more preferably less than about 0.01, and most preferably less than about 
0.00 1. 
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Another example of a useful algorithm is PILEUP. PUJEUP creates a 
multiple sequence alignment from a group of related sequences using progressive, 
pairwise alignments to show relationship and percent sequence identity. It also plots a tree 
or dendogram showing the clustering relationships used to create the alignment. PILEUP 
5 uses a simplification of the progressive alignment method of Feng & Doolittle (1987) 7. 
Mol. Evol. 35: 351-360. The method used is similar to the method described by Higgins & 
Sharp (1989) CABIOS 5: 151-153. The program can align up to 300 sequences, each of a 
maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure 
begins with the pairwise alignment of the two most similar sequences, producing a cluster 

10 of two aligned sequences. This cluster is then aligned to the next most related sequence or 
cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension 
of the pairwise alignment of two individual sequences. The final alignment is achieved by 
a series of progressive, pairwise alignments. The program is run by designating specific 
sequences and their amino acid or nucleotide coordinates for regions of sequence 

15 comparison and by designating the program parameters. Using PILEUP, a reference 

sequence is compared to other test sequences to determine the percent sequence identity 
relationship using the following parameters: default gap weight (3.00), default gap length 
weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence 
analysis software package, e.g., version 7.0 (Devereaux et al. (1984) Nuc. Acids Res. 12: 

20 387-395. 

Another preferred example of an algorithm that is suitable for multiple 
DNA and amino acid sequence alignments is the CLUSTALW program (Thompson, J. D. 
etal (1994) Nucl. Acids. Res. 22: 4673-4680). ClustalW performs multiple pairwise 
comparisons between groups of sequences and assembles them into a multiple alignment 
25 based on homology. Gap open and Gap extension penalties were 10 and 0.05, 

respectively. For amino acid alignments, the BLOSUM algorithm can be used as a protein 
weight matrix (Henikoff and Henikoff (1992) Proc. Nat'lAcad. Sci. U.S.A. 89: 10915- 
10919). 

Making Polypeptides of the Invention 
30 Recombinant methods for producing and isolating interferon homologue 

polypeptides of the invention are described above. In addition to recombinant production, 

the polypeptides may be produced by direct peptide synthesis using solid-phase techniques 
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(cf. Stewart et al. (1969) Solid-Phase Peptide Synthesis, W.H. Freeman Co, San 
Francisco; Merrifield, J. (1963) J. Am. Chem. Soc. 85:2149-2154). Peptide synthesis may 
be performed using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin 
5 Elmer, Foster City, Calif.) in accordance with the instructions provided by the 

manufacturer. For example, subsequences may be chemically synthesized separately and 
combined using chemical methods to provide full-length interferon homologues. 
Fragments of the interferon homologue polypeptides of the invention, as discussed in 
greater detail above, are also a feature of the invention and may be synthesized by using 

10 the procedures described above. 

Polypeptides of the invention can be produced by introducing into a 
population of cells a nucleic acid of the invention, wherein the nucleic acid is operatively 
linked to a regulatory sequence effective to produce the encoded polypeptide, culturing the 
cells in a culture medium to produce the polypeptide, and optionally isolating the 

15 polypeptide from the cells or from the culture medium. 

In another aspect, polypeptides of the invention can be produced by 
introducing into a population of cells a recombinant expression vector comprising at least 
one nucleic acid of the invention, wherein the at least one nucleic acid is operatively 
linked to a regulatory sequence effective to produce the encoded polypeptide, culturing the 

20 cells in a culture medium under suitable conditions to produce the polypeptide encoded by 
the expression vector, and optionally isolating the polypeptide from the cells or from the 
culture medium. 

Using Polypeptides 
Antibodies 

25 In another aspect of the invention, an interferon homologue polypeptide of 

the invention is used to produce antibodies which have, e.g., diagnostic, prophylactic and 
therapeutic uses, e.g., related to the activity, distribution, and expression of interferon 
homologues. 

Antibodies to interferon homologues of the invention may be generated by 
30 methods well known in the art. Such antibodies may include, but are not limited to, 

polyclonal, monoclonal, chimeric, humanized, single chain, Fab fragments and fragments 



76 



WO 01/25438 



PCTAJS00/27781 



produced by an Fab expression library. Antibodies, i.e., those which block receptor 
binding, are especially preferred for therapeutic or prophylactic use. 

Interferon hornologue polypeptides for antibody induction do not require 
biological activity; however, the polypeptide or oligopeptide must be antigenic. Peptides 
5 used to induce specific antibodies may have an amino acid sequence consisting of at least 
10 amino acids, preferably at least 15 or 20 amino acids. Short stretches of an interferon 
hornologue polypeptide may be fused with another protein, such as keyhole limpet 
hemocyanin, and antibody produced against the chimeric molecule. 

Methods of producing polyclonal and monoclonal antibodies are known to 

10 those of skill in the art, and many antibodies are available. See, e.g., Coligan (1991,) 
Current Protocols in Immunology Wiley/Greene, NY; and Harlow and Lane (1989) 
Antibodies: A Laboratory Manual, Cold Spring Harbor Press, NY; Stites et al. (eds.) Basic 
and Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, CA, and 
references cited therein; Goding (1986) Monoclonal Antibodies: Principles and Practice 

15 (2d ed.) Academic Press, New York, NY; and Kohler and Milstein (1975) Nature 

256:495-497. Other suitable techniques for antibody preparation include selection of 
libraries of recombinant antibodies in phage or similar vectors. See, Huse et al. (1989) 
Science 246:1275-1281; and Ward et al. (1989) Nature 341:544-546. Specific monoclonal 
and polyclonal antibodies and antisera will usually bind with a Kd of at least about 0.1 

20 ^iM, preferably at least about 0.01 \xM or better, and most typically and preferably, 0.001 
|iM or better. 

Detailed methods for preparation of chimeric (humanized) antibodies can 
be found in U.S. Patent 5,482,856. Additional details on humanization and other antibody 
production and engineering techniques can be found in Borrebaeck (ed.) (1995) Antibody 

25 Engineering, 2 nd Edition Freeman and Company, NY (Borrebaeck); McCafferty et al. 
(1996) Antibody Engineering, A Practical Approach, IRL at Oxford Press, Oxford, 
England (McCafferty), and Paul (1995) Antibody Engineering Protocols, Humana Press, 
Towata, NJ (Paul). 

In one useful embodiment, this invention provides for fully humanized 

30 antibodies against the interferon homologues of the invention. Humanized antibodies are 
especially desirable in applications where the antibodies are used as prophylactics and 
therapeutics in vivo and ex vivo in human patients. Human antibodies consist of 
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characteristically human immunoglobulin sequences. The human antibodies of this 
invention can be produced in using a wide variety of methods {see, e.g., Larrick et al. 7 
U.S. Pat. No. 5,001,065, and Borrebaeck McCafferty and Paul, supra, for a review). In 
one embodiment, the human antibodies of the present invention are produced initially in 
5 trioma cells. Genes encoding the antibodies are then cloned and expressed in other cells, 
such as nonhuman mammalian cells. The general approach for producing human 
antibodies by trioma technology is described by Ostberg et aL (1983), Hybridoma 2:361- 
367, Ostberg, U.S. Pat. No. 4,634,664, and Engelman et al, U.S. Pat. No. 4,634,666. The 
antibody-producing cell lines obtained by this method are called triomas because they are 
10 descended from three cells; two human and one mouse. Triomas have been found to 
produce antibody more stably than ordinary hybridomas made from human cells. 

Adjuvants 

In one aspect, the interferon homologue polypeptides of the present 
invention or fragments thereof are useful as adjuvants to stimulate, enhance, potentiate, or 
15 augment an immune response related to an antigen when administered together with the 
antigen or after or before delivery of the antigen. In another aspect, the invention provides 
methods for administering one or more of the polypeptides invention described herein to a 
subject. 

Therapeutic and Prophylactic Agents 
20 As described in greater detail below, the interferon homologue 

polypeptides of the present invention or fragments thereof are useful in the prophylactic 

and/or therapeutic treatment of a variety of diseases, disorders, or medical conditions. 

For example, the invention provides interferon-alpha homologue 

polypeptides (and interferon-alpha homologue nucleic acids which encode such 

25 polypeptides) that have both antiviral and antiproliferative activities in the assays 
described herein. In one aspect, the invention provides interferon-alpha homologue 
polypeptides (and interferon-alpha homologue nucleic acids which encode such 
polypeptides) in which the ratio of antiviral activity to antiproliferative activity is greater 
than that of other known interferon-alphas such as those listed in GenBank as noted 

30 herein. Such polypeptides (and nucleic acids encoding them) are useful in the therapeutic 
and/or prophylactic treatment of various diseases and disorders, such as, e.g., treatment 
regimens for hepatitis B, hepatitis C, HIV, and HSV. In such treatment regimens, some 
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such polypeptides (and nucleic acids encoding them), such as interferon-alpha homologue 
2BA8, offer significant advantages over known interferon-alpha compounds, since they 
likely exhibit lower side effects upon administration than known interferon-alpha 
compounds, such as interferon-alpha 2a, are of higher potency, and thus may require in 
5 lower dosing and cause fewer immunogenicity effects. 

SEQUENCE VARIATIONS 

Conservatively Modified Variations 

Interferon homologue polypeptides of the present invention include one or 
more conservatively modified variations (or "conservative variations" or conservative 
10 substitutions") of the polypeptide sequences disclosed herein as SEQ ID NO:36 to SEQ ID 
NO:70 and SEQ ID NO:79 to SEQ ID NO:85. Such conservatively modified variations 
comprise substitutions, additions or deletions which alter, add or delete a single amino 
acid or a small percentage of amino acids (typically less than about 5%, more typically 
less than about 4%, 2%, or 1%) in any of SEQ ID NO:36 to SEQ ID NO:70 and SEQ ID 

15 NO:79 to SEQ ED NO:85. 

For example, a conservatively modified variation (e.g., deletion) of the 166 
amino acid polypeptide identified herein as SEQ ID NO:36 has a length of at least about 
157 or 158 amino acids, preferably at least about 159 or 160 amino acids, more preferably 
at least about 162 or 163 amino acids, and still more preferably at least about 164 or 165 

20 amino acids, corresponding to a deletion of less than about 5%, 4%, 2% or 1% of the 
polypeptide sequence, respectively. 

Another example of a conservatively modified variation (e.g., a 
"conservatively substituted variation") of the polypeptide identified herein as SEQ ID 
NO:36 will contain "conservative substitutions", according to the six substitution groups 

25 set forth in Table 2 (supra), in up to about 8 residues (i.e., less than about 5%) of the 166 

amino acid polypeptide. 

The interferon homologue polypeptide sequences of the invention, 
including conservatively substituted sequences, can be present as part of larger 
polypeptide sequences such as which occur upon the addition of one or more domains for 
30 purification of the protein (e.g., poly His segments, FLAG epitope segments, etc.), e.g., 
where the additional functional domains have little or no effect on the activity of the 
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interferon-alpha portion of the protein, or where the additional domains can be removed 
by post synthesis processing steps such as by treatment with a protease. 

In another embodiment, interferon homologue polypeptides of the present 
invention comprise the following sequence, identified herein as SEQ ED NO:71: 
5 CDLPQTHSLG-Xn-X^-RA-X^-X^ 
DFG-X 38 -P-X4o-EEFD-X45-X4 6 -X^ 

X67-FSTK-X72-SS-X75-X76-W-X78-X 7 9^ 

LEACV-X,oi-Q-X 10 3-V-Xio5-X,(^ 

QRITLYL-X,32-E-X,34-KYSPC-X,4^ or a 

10 conservatively substituted variation thereof, where Xn is N or D; X12 is R, S, or K; X15 is 
L or M; Xi6 is I, M, or V; X )9 is A or G; X22 is G or R; X 2 4 is I or T; X 2 6 is P or H; X 34 is 
H, Y or Q; X 38 is F or L; X40 is Q or R; X 45 is G or S; X46 is N or H; X47 is Q or R; X 50 is 
K or R; X 5i is A or T; X55 is S or F; X 5 6 is V or A; X 57 is L or F; X 60 is M or I; X$i is I or 
M; X67 is L or F; X72 is D or N; X 75 is A or V; X 7 6 is A or T; X 78 is E or D; X 79 is Q or E; 

15 X 80 is S, R, T, or N; X 83 is E or D; X 85 is F or L; X 86 is S or Y; X 88 is E or G; X 90 is Y, H, 
N; X95 is D, E, or N; X 10! is I, M, or V; X l0 3 is E or G; X !0 5 is G or W; X I0 6 is V or M; 
X107 is E, G, or K; Xi 08 is E or G; X| !4 is V, E, or G; X, 16 is S or P; X| 2 i is K or R; X m is 
F or L; Xi 32 is T, I, or M; X134 is K or R; and X140 is A or S; or a fragment of said SEQ ID 
NO:71. As defined above, a conservatively modified variation of the sequence of SEQ ID 

20 NO:71 can include up to a total of about 8 amino acid deletions, insertions, or conservative 
substitutions in the 166 amino acid polypeptide, excluding the positions designated X in 
SEQ ID NO:71, which correspond to the amino acid explicitly defined. 

As an example, if four conservative substitutions were localized in the 
subsequence corresponding to amino acids 141-166 of SEQ ID NO:71, examples of 

25 conservatively substituted variations of this subsequence, 

WEVVR AEEMR SFSFS TNLQK RLRRKE, include: 
WEWR SEIMR SFSYS TNLQR RLRRKD and 

WELVR AEIVR SFSFS TNLNK RLRKKE, and the like, where the conservative 
substitutions are underlined. 
30 A feature of the invention is an interferon homologue polypeptide 

comprising at least about 20, usually at least about 25, typically at least about 30, 40, 50, 
60, 70, 80, 90, or 100 contiguous amino acids of any one of SEQ ID NOS:36-71 or SEQ 
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ID NOS:79-85. In other embodiments, the polypeptide typically comprises at least about 
100, 110, 120, 125, 130, 140, 150, 155, 158, 160, 163, 164, or 165 contiguous amino acids 
of any one of SEQ ID NOS:36-70 or SEQ ID NOS:79-85. 

In other embodiments, the interferon homologue polypeptide of the 
5 invention comprises an amino acid sequence comprising one or more of amino acid 
residues (Tyr or Gln)34, Gly37, Phe38, Lys71, Ala76, Tyr90, Ilel32, Argl34, Phel52, 
Lysl60, and Glul66, wherein the numbering of the amino acids corresponds to the 
numbering of amino acids in the amino acid sequence of SEQ ID NO:36. In a preferred 
embodiment, the interferon homologue polypeptide comprises an amino acid sequence 

10 comprising at least 150, 155, or 166 contiguous amino acid residues of any one of SEQ ID 
NOS:36-70, further comprising Lysl60 and Glul66, wherein the numbering of the amino 
acids corresponds to the numbering of amino acids in the amino acid sequence of SEQ ID 
NO:36. Some such polypeptides also exhibit an antiproliferative activity of at least about 
8.3xl0 6 units/milligram in the human Daudi cell line - based assay, or an antiviral activity 

15 of at about least 2.1xl0 7 units/milligram (mg) in the human WISH cell/EMCV-based 
assay. 

DEFINING POLYPEPTIDES BY IMMUNOREACTrVTTY 

Because the polypeptides of the invention provide a variety of new 
polypeptide sequences as compared to other alpha interferon homologues, the 

20 polypeptides also provide a new structural features which can be recognized, e.g., in 
immunological assays. The generation of antisera which specifically binds the 
polypeptides of the invention, as well as the polypeptides which are bound by such 
antisera, are features of the invention. 

The invention includes interferon- alpha homologue polypeptides that 

25 specifically bind to or that are specifically immunoreactive with an antibody or antisera 

generated against an immunogen comprising an amino acid sequence selected from one or 
more of SEQ ID NO:36 to SEQ ID NO:70, SEQ ID NO:71, and SEQ ID NO:79 to SEQ 
ID NO:85. To eliminate cross-reactivity with other interferon-alpha polypeptides, e.g., 
known interferon-alpha polypeptides, the antibody or antisera (or antiserum) is subtracted 

30 with available known alpha interferons, such as those polypeptides encoded by nucleic 
acids represented by GenBank accession numbers J00210 (alpha-D), J00207 (Alpha-A), 
X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), V00545 
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(IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), 
V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 (Gx-1), 
101614, 101787, 107821, M12350 (alpha-F), and M38289, V00549 (alpha-2a), and 108313 
(alpha-Con 1), or any other known interferon-alpha polypeptides (typically referred to as 
5 the "control alpha interferon polypeptides"). Where the accession number corresponds to 
a nucleic acid, a polypeptide encoded by the nucleic acid is generated and used for 
antibody/an ti sera subtraction purposes. Where the nucleic acid corresponds to a non- 
coding sequence, e.g., a pseudo gene, an amino acid which corresponds to the reading 
frame of the nucleic acid is generated {e.g., synthetically), or is minimally modified to 

10 include a start codon for recombinant production. 

In one typical format, the immunoassay uses a polyclonal antiserum which 
was raised against one or more polypeptides comprising one or more of the amino acid 
sequences corresponding to one or more of: SEQ ID NO:36 to SEQ ID NO:70, SEQ ID 
NO:71, and SEQ ID NO:79 to SEQ ID NO:85, or a substantial subsequence thereof {i.e., 

15 at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 98% or more of the full 
length sequence provided). The full set of potential polypeptide immunogens derived 
from one or more of SEQ ID NO:36 to SEQ ID NO:70, SEQ ID NO:7, and SEQ ID 
NO:79 to SEQ ID NO:85 are collectively referred to below as "the immunogenic 
polypeptides." The resulting antisera is optionally selected to have low cross-reactivity 

20 against the control alpha interferon polypeptides and/or other known interferon 

polypeptides and any such cross-reactivity is removed by immuno absorption with one or 
more of the control alpha interferon polypeptides, prior to use of the polyclonal antiserum 
in the immunoassay. 

In order to produce antisera for use in an immunoassay, one or more of the 

25 immunogenic polypeptides is produced and purified as described herein. For example, 
recombinant protein may be produced in a mammalian cell line. An inbred strain of mice 
(used in this assay because results are more reproducible due to the virtual genetic identity 
of the mice) is immunized with the immunogenic polypeptide(s) in combination with a 
standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization protocol 

30 {see Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor 

Publications, New York, for a standard description of antibody generation, immunoassay 
formats and conditions that can be used to determine specific immunoreactivity). 
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Alternatively, one or more synthetic or recombinant polypeptides derived from the 
sequences disclosed herein is conjugated to a carrier protein and used as an immunogen. 

Polyclonal sera are collected and titered against the immunogenic 
polypeptide(s) in an immunoassay, for example, a solid phase immunoassay with one or 
5 more of the immunogenic polypeptides immobilized on a solid support. Polyclonal 

antisera with a titer of 10 6 or greater are selected, pooled and subtracted with the control 
alpha interferon polypeptides to produce subtracted pooled titered polyclonal antisera. 

The subtracted pooled titered polyclonal antisera are tested for cross 
reactivity against the control alpha interferon polypeptides. Preferably at least two of the 
10 immunogenic alpha interferon polypeptides are used in this determination, preferably in 
conjunction with at least two of the control alpha interferon polypeptides, to identify 
antibodies which are specifically bound by the immunogenic polypeptides(s). 

In this comparative assay, discriminatory binding conditions are determined 
for the subtracted titered polyclonal antisera which result in at least about a 5-10 fold 
15 higher signal to noise ratio for binding of the titered polyclonal antisera to the 

immunogenic alpha interferons as compared to binding to the control alpha interferons . 
That is, the stringency of the binding reaction is adjusted by the addition of non-specific 
competitors such as albumin or non-fat dry milk, or by adjusting salt conditions, 

« 

temperature, or the like. These binding conditions are used in subsequent assays for 
20 determining whether a test polypeptide is specifically bound by the pooled subtracted 
polyclonal antisera. In particular, test polypeptides which show at least a 2-5x higher 
signal to noise ratio than the control polypeptides under discriminatory binding conditions, 
and at least about a Vi signal to noise ratio as compared to the immunogenic 
polypeptide(s), shares substantial structural similarity or homology with the immunogenic 
25 polypeptide as compared to known alpha interferons, and is, therefore a polypeptide of the 
invention. 

In another example, immunoassays in the competitive binding format are 
used for detection of a test polypeptide. For example, as noted, cross-reacting antibodies 
are removed from the pooled antisera mixture by immunoabsorption with the control alpha 
30 interferon polypeptides. The immunogenic polypeptide(s) are then immobilized to a solid 
support which is exposed to the subtracted pooled antisera. Test proteins are added to the 
assay to compete for binding to the pooled subtracted antisera. The ability of the test 
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protein(s) to compete for binding to the pooled subtracted antisera as compared to the 
immobilized protein(s) is compared to the ability of the immunogenic polypeptide(s) 
added to the assay to compete for binding (the immunogenic polypeptides compete 
effectively with the immobilized immunogenic polypeptides for binding to the pooled 
5 antisera). The percent cross-reactivity for the test proteins is calculated, using standard 
calculations. 

In a parallel assay, the ability of the control proteins to compete for binding 
to the pooled subtracted antisera is determined as compared to the ability of the 
immunogenic polypeptide(s) to compete for binding to the antisera. Again, the percent 

10 cross-reactivity for the control polypeptides is calculated, using standard calculations. 
Where the percent cross-reactivity is at least 5-10x as high for the test polypeptides, the 
test polypeptides are said to specifically bind the pooled subtracted antisera. 

In general, the immunoabsorbed and pooled antisera can be used in a 
competitive binding immunoassay as described herein to compare any test polypeptide to 

15 the immunogenic polypeptide(s). In order to make this comparison, the two polypeptides 
are each assayed at a wide range of concentrations and the amount of each polypeptide 
required to inhibit 50% of the binding of the subtracted antisera to the immobilized protein 
is determined using standard techniques. If the amount of the test polypeptide required is 
less than twice the amount of the immunogenic polypeptide that is required, then the test 

20 polypeptide is said to specifically bind to an antibody generated to the immunogenic 
polypeptide, provided the amount is at least about 5-10x as high as for a control 
polypeptide. 

As a final determination of specificity, the pooled antisera is optionally 
fully immunosorbed with the immunogenic polypeptide(s) (rather than the control 

25 polypeptides) until little or no binding of the resulting immunogenic polypeptide 
subtracted pooled antisera to the immunogenic polypeptide(s) used in the 
immunoabsorption is detectable. This fully immunosorbed antisera is then tested for 
reactivity with the test polypeptide. If little or no reactivity is observed (i.e., no more than 
2x the signal to noise ratio observed for binding of the fully immunosorbed antisera to the 

30 immunogenic polypeptide), then the test polypeptide is specifically bound by the antisera 
elicited by the immunogenic protein. 
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ANTIPROLIFERATIVE PROPERTIES OF INTERFERON HOMOLOGUES 

The effect of interferon homologues on cellular growth was examined in a 
human Daudi cell line - based assay as described in Example 1. Fig. 2 shows the 
antiproliferative activity of exemplary interferon homologues of the invention comprising 
5 amino acid sequences SEQ ID NO:36 to SEQ ID NO:54, in comparison to control 
interferons, human IFN-alpha 2a and consensus human IFN-alpha (Conl). The graph 
shows the number of Units of activity per milligram (mg) of interferon test sample (Y 
axis) for a set of exemplary interferon alpha homologues, each of which is designated with 
a name (clone name) on the X axis, compared with that of human IFN-alpha 2a and 

10 consensus human IFN-alpha. These results indicate that compositions comprising an 

interferon-alpha homologue of the present invention can be used in methods to inhibit or 
reduce proliferation of tumor cells, including, but not limited to: human carcinoma cells, 
hematopoietic cancer cells, human leukemia cells, human lymphoma cells, and human 
melanoma cells. Inhibition can be performed in vitro (useful, e.g., in a variety of 

15 proliferation assays), ex vivo or in vivo (useful, e.g., as a therapeutic or prophylactic 
agent). 

Interferon-alpha homologues of the present invention show diverse activity 
patterns against a variety of cancer cell lines (see, e.g., Example 2). An in vitro cell line 
screen (as described in, e.g., Monks, A. et al. (1991) J. Nat'l Cancer Inst. 83:757-766) was 

20 used to assay interferon-alpha homologues of the invention for selective growth inhibition 
and/or cell killing of particular cancer cell lines. The human cancer cell lines screened 
(see, e.g., Example 2, Table 3) include leukemias, melanomas, and cancers of the lung, 
colon, brain, central nervous system, ovary, breast, prostate, and kidney. 

Three activity parameters were determined in the cancer cell line screen: 1) 

25 GI50 ("growth inhibition at 50%"), a measure of growth inhibition activity, is the 

concentration of interferon test sample (IFN alpha homologue or control IFN alpha) at 
which cell growth is inhibited by 50%, as measured by a 50% reduction in the net 
protein/polypeptide increase in the interferon test sample as compared to that observed in 
the control cells (no test sample) at the end of the incubation period; 2) TGI ("total growth 

30 inhibition") a measure of cytostatic activity, is the concentration of interferon test sample 
at which cell growth of a particular cell line is totally inhibited, wherein the amount of 
cellular protein at the end of the incubation period equals the amount of cellular protein at 
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the beginning of the incubation period ; and 3) LC50, a measure of cytotoxic activity, is 
the concentration of interferon test sample at which a 50% reduction in the measured 
amount of cellular protein at the end of the incubation as compared to that at the beginning 
of the incubation period is observed, indicating a net loss of cells following interferon test 
5 sample addition. Further details of the assay and data analysis procedures are provided in 
Example 2. 

The activity parameters of exemplary interferon-alpha homologue 3DA1 1 
(SEQ ID NO:40) against a variety of cancer cell lines are shown in Figs. 3A, 3B, and 3C, 
in comparison with the interferon-alpha Conl and human interferon-alpha 2a controls. 
10 With respect to growth inhibition activity, in particular, homologue 3DA1 1 

and control interferon-alpha Conl showed significant activity against most of the cell lines 
tested, with the interferon-alpha Conl exhibiting generally higher activity, and interferon- 
alpha 2a generally exhibiting lower overall activity and in only a subset of the cell lines 
(Fig. 3A). 

15 In contrast, in particular, a pronounced difference was observed in the 

cytotoxic and cytostatic activities of homologue 3DA1 1 in comparison to both interferon- 
Conl and human interferon-alpha 2a controls. In the concentration range tested, 
homologue 3DA1 1 showed significant cytostatic activity against a population of cells of 
eleven of the cell lines, while interferon-Con 1 showed activity against only a population 

20 of cells of one of the cell lines, against which homologue 3DA11 was also active (Fig. 
3B). IFN-alpha 2a, on the other hand, was not active in this assay against any of the 
tested cell lines. Homologue 3DA1 1 thus has a broader cytostatic activity profile than 
consensus human interferon-alpha (Conl) and human interferon-alpha 2a. 

Homologue 3DA1 1 also showed significant cytotoxic activity in 

25 comparison to the interferon-Con 1 and human interferon-alpha 2a controls (Fig. 3C). 

Surprisingly, homologue 3DA11 displayed cytotoxic activity against a population of cells 
of 8 of the cell lines, whereas neither the interferon-Con 1 nor the interferon-alpha 2a 
controls exhibited measurable activity against a population of cells of any of the cell lines 
at the concentration range employed in the assay. Thus, homologue 3DA1 1 also has a 

30 broader cytotoxic activity profile than interferon-Con 1 and human interferon-alpha 2a. 

Figs. 4A-4D illustrate the cytostatic activity (as reflected by the TGI value) 
of exemplary interferon-alpha homologues of the invention. In each figure, the relative 
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cytostatic activity (expressed as -log TGI) against a population of cells of particular cancer 
cell line is plotted for various interferon-alpha homologues and for the two control 
interferons (interferon-Conl and human interferon-alpha 2a). 

Of the exemplary homologues tested, homologues 1D3 (SEQ ID NO:54) 
5 and 3DA1 1 (SEQ ID NO:40), but neither of the control interferons, exhibited significant 
cytostatic activity against a population of cells of leukemia cell line RPMI-8226 over the 
concentration range of the assay (Fig. 4A). In this example, the 1D3 and 3DAI 1 
homologues showed at least about 25-fold higher cytostatic activity against a population 
of the cells (corresponding to a difference in TGI of at least about 1.4 log units) than did 

10 either of the controls (interferon-Conl or interferon-alpha 2a) against a population of cells 
of the leukemia cell line. 

Homologues 1D3, 2G5 (SEQ ID NO:45), 6CG3 (SEQ ID NO:52) and 
3DA11, but neither of the control interferons, exhibited significant cytostatic activity 
against lung cancer cell line NCI-H23 (Fig. 4B). In this example, the 1D3, 2G5, 6CG3, 

15 and 3DA1 1 homologues showed at least about 12-fold higher cytostatic activity a 

population of cells of a lung cancer cell line (corresponding to a difference in TGI of at 
least about 1.1 log units) than either interferon-Conl or interferon-alpha 2a against a 
population of cells of the lung cancer cell line. 

Homologues 1D3, 2G5, and 3DA1 1, but neither of the control interferons, 

20 showed significant cytostatic activity against a population of cells of renal cancer cell line 
ACHN (Fig. 4C). In this example, the 1D3, 2G5, and 3DA11 homologues showed at least 
about 35-fold higher cytostatic activity a population of cells of said renal cancer cell line 
(corresponding to a difference in TGI of at least about 1.55 log units) than either 
interferon-Conl or interferon-alpha 2a against a population of cells of renal cancer cell 

25 line. 

Homologues 1D3, 2G5, 3DA11, 2CA5 (SEQ ED NO:42) and 2DB11 (SEQ 
ID NO:41), and the interferon-Conl control, but not the interferon alpha-2a control, 
exhibited significant cytostatic activity against a population of cells of an ovarian cancer 
cell line OVCAR-3 (Fig. 4D). In this example, homologue 1D3 showed at least about 2- 
30 fold higher cytostatic activity (corresponding to a difference in TGI of at least about 0.3 
log units) than interferon-Conl, and the 1D3, 2G5, 3DA11, 2CA5, and 2DB11 
homologues showed at least about 40-fold higher cytostatic activity (corresponding to a 
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difference in TGI of at least about 1.6 log units) than interferon-alpha 2a, against 
respective populations of cells of the ovarian cancer cell line. 

From the exemplary data provided herein, it is apparent that interferon- 
alpha homologues of the invention showed a variety of cytostatic activity profiles, which 
5 differed significantly from those of the interferon-alpha Conl and interferon alpha-2a. 

The present invention includes an interferon-alpha homologue having 
increased cytostatic activity relative to human interferon-alpha 2a or to consensus human 
interferon-alpha, Conl. In various embodiments, the interferon-alpha homologue has at 
least about 2-fold higher cytostatic activity a population of cells of a cancer cell line (Le. % 

10 has a TGI value at least about 2-fold lower) than does human interferon-alpha 2a, or has at 
least 2-fold higher cytostatic activity than interferon-Conl, against a population of cells of 
one or more cancer cell lines selected from the following: a leukemia cell line; a 
melanoma cell line; a lung cancer cell line; a colon cancer cell line; a central nervous 
system (CNS) cancer cell line; an ovarian cancer cell line; a breast cancer cell line; a 

15 prostate cancer cell line; and a renal cancer cell line. 

In other embodiments, the interferon-alpha homologue has at least about 5- 
fold higher cytostatic activity a population of cells of a cancer cell line (i.e., has a TGI 
value at least about 5-fold lower) than does human interferon-alpha 2a, or has at least 
about 5-fold higher cytostatic activity than interferon-Conl, against a population of cells 

20 of one or more cancer cell lines selected from the following: a leukemia cell line; a 
melanoma cell line; a lung cancer cell line; a colon cancer cell line; a central nervous 
system (CNS) cancer cell line; an ovarian cancer cell line; a breast cancer cell line; a 
prostate cancer cell line; and a renal cancer cell line. In other embodiments, the 
interferon-alpha homologue has at least about 10-fold higher cytostatic activity a 

25 population of cells of a cancer cell line (i.e., has a TGI value at least about 10-fold lower) 
than does human interferon-alpha 2a, or has at least about 10- fold higher cytostatic 
activity than interferon-Conl, against a population of cells of one or more cancer cell lines 
selected from the following: a leukemia cell line; a melanoma cell line; a lung cancer cell 
line; a colon cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a breast 

30 cancer cell line; a prostate cancer cell line; and a renal cancer cell line. 

The invention includes an interferon-alpha homologue having increased 
cytotoxic activity relative to human interferon-alpha 2a or relative to interferon-Conl. In 
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various embodiments, the interferon-alpha homologue has at least about 2-fold higher 
cytotoxic activity (i.e., has an LC50 value at least about 2-fold lower), at least 5-fold 
higher cytotoxic activity, or at least 10-fold higher cytotoxic activity, than human 
interferon-alpha 2a against a population of cells of one or more cancer cell lines selected 
5 from the following: a leukemia cell line; a melanoma cell line; a lung cancer cell line; a 
colon cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a breast cancer 
cell line; a prostate cancer cell line; and a renal cancer cell line. In other embodiments, the 
interferon-alpha homologue has at least about 2-fold higher cytotoxic activity (i.e., has an 
LC50 value at least about 2-fold lower), at least about 5-fold higher cytotoxic activity, or 
10 at least about 10-fold higher cytotoxic activity, than interferon-Conl, against a population 
of cells of at least one cancer cell line selected from: a leukemia cell line; a melanoma cell 
line; a lung cancer cell line; a colon cancer cell line; a CNS cancer cell line; an ovarian 
cancer cell line; a breast cancer cell line; a prostate cancer cell line; and a renal cancer cell 
line. 

15 The invention includes an interferon-alpha homologue having increased 

growth inhibition activity relative to human interferon-alpha 2a or to interferon-Conl . In 
various embodiments, the interferon-alpha homologue has at least about 2-fold higher 
growth inhibition activity (i.e., has a GI50 value at least about 2-fold lower), at least about 
5-fold higher growth inhibition activity, or at least about 10-fold higher growth inhibition 

20 activity, than human interferon-alpha 2a, against a population of cells of one or more 

cancer cell lines selected from: a leukemia cell line; a melanoma cell line; a lung cancer 
cell line; a colon cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a 
breast cancer cell line; a prostate cancer cell line; and a renal cancer cell line. In other 
embodiments, the interferon-alpha homologue has at least about 2-fold higher growth 

25 inhibition activity (i.e., has a GI50 value at least about 2-fold lower), at least about 5-fold 
higher growth inhibition activity, or at least about 10-fold higher growth inhibition 
activity, than interferon-Conl, against at least one cancer cell line selected from the 
following: a leukemia cell line; a melanoma cell line; a lung cancer cell line; a colon 
cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a breast cancer cell 

30 line; a prostate cancer cell line; and a renal cancer cell line. 

The discovery set forth herein that interferons (such as the interferon-alpha 
homologues described herein) can be evolved, modified, or recombined to display a 
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variety of activity profiles provides an opportunity for evolving and creating customized 
and specific interferon homologues for the treatment of a variety of specific diseases or 
disease conditions, including, e.g., a variety of cancers or related conditions. For example, 
an interferon homologue of the invention optimized to have increased potency against a 
5 particular target cancer cell type may also be optimized to have (advantageously) reduced 
toxicity towards a non-target cell(s), and thus may produce lower side effects in the 
subject to which the homologue is administered (e.g., patient). 

The present invention further provides an opportunity to optimize 
interferon homologues against tumor cells taken from a subpopulation of subjects {e.g., 

10 mammals or human patients), or even from an individual subject {e.g., mammal or human 
patient), providing therapeutic or prophylactic treatment tailored to the individual subject. 
Optimized interferon homologues of the invention may provide therapeutic or 
prophylactic benefit against cancers or related conditions or other interferon-treatable 
disorders or conditions which are otherwise unresponsive to currently-available interferons 

15 or to other treatment regimes. 

ANTIVIRAL PROPERTIES OF INTERFERON HOMOLOGUES 

The antiviral activity of interferon homologues of the present invention was 
evaluated in a human WISH cell/EMCV assay as described in Example 1. Fig. 2 shows 
the antiviral activity of exemplary interferon homologues of the invention comprising 

20 amino acid sequences SEQ ID NO:36 to SEQ ID NO:54. 

Improved in vitro antiviral activity of exemplary IFN-alpha homologues of 
the invention has been shown to be maintained in vivo in a murine model system. Two 
IFN-alpha homologues of the invention, designated CH2.2 and CH2.3 (SEQ ID NOS:84 
and 85, respectively), were previously shown to have about 206,000-fold and 138,000-fold 

25 improved antiviral activity, respectively, compared to human IFN-alpha 2a in a murine 
cell-based assay, as well as significantly higher activity in the same assay as compared to 
native murine interferons (Chang et aL (1999) Nature BiotechnoL 17:793-797). As 
described in Example 3 below, Baib/c mice challenged with a lethal dose of vesicular 
stomatitis virus (VSV) were administered varying doses of IFN-alpha homologues, 

30 designated CH2.2 and CH2.3, native murine interferon Mu-IFN alpha 4, and human IFN- 
alpha 2a. The high in vitro activity correlated well with the observed in vivo activity (Fig. 

5). The CH2.2 and CH2.3 homologues were fully effective in protecting mice from the 
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lethal viral challenge, while the same dosage of the native murine interferon was partially 
effective and the human IFN-alpha 2a was completely ineffective. These results indicate 
that compositions comprising interferon homologues of the present invention can be used 
in methods to inhibit viral replication in subjects infected with viruses including, but not 
5 limited to: human immunodeficiency virus (HIV), hepatitis C virus (HCV), herpes simplex 
virus (HSV), and hepatitis B virus (HBV). Inhibition can be performed in vitro ( useful, 
e.g., in a variety of antiviral assays), ex vivo (useful e.g., as a therapeutic or prophylactic 
agent in ex vivo methods discussed herein), or in vivo ( useful, e.g., as a therapeutic or 
prophylactic agent in in vivo methods discussed herein). 

10 INTERFERON HOMOLOGUES IN THE TREATMENT OF AUTOIMMUNE AND 
OTHER IMMUNE-RELATED DISORDERS 

Compositions of the present invention can be used to therapeutically or 

prophylactically treat and thereby alleviate a variety of immune system-related disorders 

characterized by hyper- or hypo-active immune system function or other features. Such 

15 disorders include hyperallergenicity and autoimmune disorders, such as multiple sclerosis, 

type I (insulin dependent) diabetes mellitus, lupus erythematosus, amyotrophic lateral 

sclerosis, Crohn's disease, rheumatoid arthritis, stomatitis, asthma, allergies, psoriasis and 

the like. 

THERAPEUTIC AND PROPHYLACTIC COMPOSITIONS 
20 Therapeutic or prophylactic compositions comprising one or more 

interferon homologue polypeptides or nucleic acids of the invention are tested in 
appropriate in vitro, ex vivo, and in vivo animal models of disease, to confirm efficacy, 
tissue metabolism, and to estimate dosages, according to methods well known in the art. 
In particular, dosages can be determined by activity comparison of the alpha interferon 
25 homologues to existing alpha interferon therapeutics or prophylactics, i.e., in a relevant 
assay. In one aspect, the invention provides methods comprising administering one or 
more interferon homologue nucleotides or polypeptides of the invention (or fragments 
thereof) described above to a mammal, including, e.g., a human, primate, mouse, pig, cow, 
goat, rabbit, rat, guinea pig, hamster, horse, sheep; or a non-mammalian vertebrate such as 
30 a bird {e.g., a chicken or duck) or a fish, or invertebrate, as described in greater detail 
below. Such compositions typically comprise one or more interferon homologue 
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nucleotides or polypeptides of the invention (or fragments thereof) and an excipient, 
including, e.g., a pharmaceutically acceptable excipient. 

In one aspect, a composition of the invention is produced by digesting one 
or more nucleic acids of the invention (or fragments thereof) with a restriction 
5 endonuclease, an RNase, or a DNase. 

In another aspect of the invention, compositions produced by incubating 
one or more nucleic acids described above in the presence of deoxyribonucelotide 
triphosphates and a nucleic acid polymerase, e.g., a thermostable polymerase, are 
provided. 

10 The invention also includes compositions comprising two or more nucleic 

acids described above. The composition may comprise a library of nucleic acids, where 
the library contains at least about 5, 10, 20, 50, 100, 150, or 200 or more such nucleic 
acids. 

Administration is by any of the routes normally used for introducing a 

15 molecule into ultimate contact with blood or tissue cells. The interferon-alpha 

homologues of the invention are administered in any suitable manner, preferably with 
pharmaceutically acceptable carriers. Suitable methods of administering such interferon 
homologues in the context of the present invention to a patient are available, and, although 
more than one route can be used to administer a particular composition, a particular route 

20 can often provide a more immediate and more effective reaction than another route. 

Pharmaceutically acceptable carriers are determined in part by the 
particular composition being administered, as well as by the particular method used to 
administer the composition. Accordingly, there is a wide variety of suitable formulations 
of pharmaceutical compositions of the present invention. 

25 Polypeptide compositions can be administered for any of the prophylactic, 

therapeutic, and diagnostic methods described herein by a number of routes including, but 
not limited to oral, intravenous, intraperitoneal, intramuscular, transdermal, subcutaneous, 
topical, sublingual, vaginal, or rectal means, or by inhalation. Interferon homotogue 
polypeptide compositions can also be administered via liposomes. Such administration 

30 routes and appropriate formulations are generally known to those of skill in the art. 

The interferon homologue polypeptide or nucleic acid, alone or in 
combination with other suitable components, can also be made into aerosol formulations 
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{Le., they can be "nebulized") to be administered via inhalation. Aerosol formulations can 
be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, 
propane, nitrogen, and the like. 

Formulations suitable for parenteral administration, such as, for example, 
5 by intraarticular (in the joints), intravenous, intramuscular, intradermal, intraperitoneal, 
and subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection 
solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that render the 
formulation isotonic with the blood of the intended recipient, and aqueous and non- 
aqueous sterile suspensions that can include suspending agents, solubilizers, thickening 

10 agents, stabilizers, and preservatives. The formulations of packaged nucleic acid can be 
presented in unit-dose or multi-dose sealed containers, such as ampules and vials. 

Parenteral administration and intravenous administration are preferred 
methods of administration. In particular, the routes of administration already in use for 
existing alpha interferon therapeutics or prophylactics, along with formulations in current 

15 use, are preferred routes of administration and formulation for the alpha interferon 
homologue polypeptide and nucleic acids of the invention. 

Cells transduced with the interferon homologue nucleic acids as described 
above in the context of ex vivo or in vivo therapy can also be administered intravenously or 
parenterally as described above. It will be appreciated that the delivery of cells to subjects 

20 {e.g., human patients) is routine, e.g., delivery of cells to the blood via intravenous or 
intraperitoneal administration. 

The dose of interferon homologue polypeptide or nucleic acid of the 
invention administered to a subject {e.g., patient), in the context of the present invention is 
sufficient to effect a beneficial therapeutic or prophylactic response in the subject {e.g., 

25 patient) over time, or to inhibit infection by a pathogen, depending on the application. The 
dose will be determined by the efficacy of the particular vector, or formulation, and the 
activity interferon homologue employed and the condition of the patient, as well as the 
body weight or surface area of the patient to be treated. The size of the dose also will be 
determined by the existence, nature, and extent of any adverse side-effects that accompany 

30 the administration of a particular vector, formulation, transduced cell type or the like in a 
particular patient. 
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In the therapeutic and prophylactic treatment methods of the invention 
described herein, an effective amount of an interferon-alpha nucleic acid (e.g., DNA or 
mRNA) of the invention (e.g., nucleic acid dosage) will generally be in the range of, e.g., 
from about 0.05 microgram/kilogram (kg) to about 50 mg/kg, usually about 0.005-5 
5 mg/kg. However, as will be understood, the effective amount of the nucleic acid (e.g., 
nucleic acid dosage) and/or polpeptide (e.g., polypeptide dosage) will vary in a manner 
apparent to those of ordinary skill in the art according to a number of factors, including the 
activity or potency of the polypeptide, the activity or potency of any nucleic acid construct 
(e.g., vector, promoter, expression system) to be administered, the disease or condition 
10 (e.g., particular cancer) to be treated, and the subject to which or whom the nucleic acid is 
delivered. 

For delivery of some polypeptides, e.g., by delivering nucleic acids 
encoding such polypeptides, for example, adequate levels of translation and/or expression 
are achieved with a nucleic acid dosage of, e.g., about 0.005mg/kg to about 5 mg/kg. 

15 Dosages for other polypeptides (and nucleic acids encoding them) having a known 

biological activity can be readily determined by those of skill in the art according to the 
factors noted above. Dosages used for other known interferon-alphas for particular 
diseases provide guidelines for determining dosage and treatment regimen for a nucleic 
acid or polypeptide of the invention. An effective amount of an interferon-alpha 

20 homologue polypeptide may be in the range of from about 1 microgram to about 1 
milligram, and more typically from about 1 microgram to about 100 micrograms. 

A composition for use in therapeutic and prophylactic treatment methods of 
the invention described herein may comprise, e.g., a concentration of an interferon-alpha 
homologue nucleic acid (e.g., DNA or mRNA) of the invention of from about 0.1 

25 microgram/milliliter (ml) to about 20 mg/ml and a pharmaceutically acceptable carrier 
(e.g., aqueous carrier). 

A composition for use in therapeutic and prophylactic treatment methods of 
the invention described herein may comprise, e.g., a concentration of an interferon-alpha 
homologue polypeptide of the invention in an amount as described above and herein and a 

30 pharmaceutically acceptable carrier (e.g., aqueous carrier). 

In determining the effective amount of the vector, cell type, or formulation 
to be administered in the treatment or prophylaxis of cancers or viral diseases, the 
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physician evaluates circulating plasma levels, vector/cell/formulation/ interferon 
homologue toxicities, progression of the disease, and the production of anti- 
vector/interferon homologue antibodies. 

The dose administered, e.g., to a 70 kilogram patient will be in the range 
5 equivalent to dosages of currently-used interferon-alpha therapeutic or prophylactic 

proteins, and doses of vectors or cells which produce interferon homologue sequences are 
calculated to yield an equivalent amount of interferon homologue nucleic acid or 
expressed protein. The vectors of this invention can supplement treatment of cancers and 
vi rally-mediated conditions by any known conventional therapy, including cytotoxic 

10 agents, nucleotide analogues (e.g., when used for treatment of HIV infection), biologic 
response modifiers, and the like. 

For administration, interferon homologues and transduced cells of the 
present invention can be administered at a rate determined by the LD-50 of the interferon 
homologue polypeptide or nucleic acid, vector, or transduced cell type, and the side- 

15 effects of the interferon homologue polypeptides or nucleic acids, vector or cell type at 
various concentrations, as applied to the mass and overall health of the patient. 
Administration can be accomplished via single or divided doses. 

For introduction of recombinant alpha-interferon nucleic acid transduced 
cells into a subject (e.g., patient), blood samples are obtained prior to infusion, and saved 

20 for analysis. Between 1 X 10 6 and 1 X 10 12 transduced cells are infused intravenously 
over 60- 200 minutes. Vital signs and oxygen saturation by pulse oximetry are closely 
monitored. Blood samples are obtained 5 minutes and 1 hour following infusion and 
saved for subsequent analysis. Leukopheresis, transduction and reinfusion are optionally 
repeated every 2 to 3 months for a total of 4 to 6 treatments in a one year period. After the 

25 first treatment, infusions can be performed on a outpatient basis at the discretion of the 
clinician. If the reinfusion is given as an outpatient, the participant is monitored for at 
least 4, and preferably 8 hours following the therapy. Transduced cells are prepared for 
reinfusion according to established methods. See Abrahamsen et al (1991) /. Clin. 
Apheresis 6:48-53; Carter et al (1988) J. Clin. Arpheresis 4:113-117; Aebersold et al. 

30 (1988), J. Immunol. Methods 1 12:1-7; Muul et al (1987) J. Immunol Methods 101:171- 
181 and Carter et al (1987) Transfusion 27:362-365. After a period of about 2-4 weeks in 
culture, the cells should number between 1 X 10 6 and 1 X 10 12 . In this regard, the growth 

95 



WO 01/25438 



PCT/US00/27781 



characteristics of cells vary from patient to patient and from cell type to cell type. About 
72 hours prior to reinfusion of the transduced cells, an aliquot is taken for analysis of 
phenotype, and percentage of cells expressing the therapeutic or prophylactic agent. 

If a subject (e.g., patient) undergoing infusion of a vector or transduced cell 
5 or protein formulation develops fevers, chills, or muscle aches, he/she receives the 

appropriate dose of aspirin, ibuprofen, acetaminophen or other pain/fever controlling drug. 
Subjects (e.g., patients) who experience reactions to the infusion such as fever, muscle 
aches, and chills are premedicated 30 minutes prior to the future infusions with either 
aspirin, acetaminophen, or, e.g., diphenhydramine. Meperidine is used for more severe 
10 chills and muscle aches that do not quickly respond to antipyretics and antihistamines. 
Cell infusion is slowed or discontinued depending upon the severity of the reaction. 

THERAPEUTIC AND PROPHYLACTIC TREATMENT METHODS 

The present invention also includes methods of therapeutically or 
prophylactically treating a disease or disorder by administering in vivo or ex vivo one or 

15 more nucleic acids or polypeptides of the invention described above (or compositions 

comprising a pharmaceutically acceptable excipient and one or more such nucleic acids or 
polypeptides) to a subject, including, e.g., a mammal, including, e.g., a human, primate, 
mouse, pig, cow, goat, rabbit, rat, guinea pig, hamster, horse, sheep; or a non-mammalian 
vertebrate such as a bird (e.g., a chicken or duck) or a fish, or invertebrate. 

20 In one aspect of the invention, in ex vivo methods, one or more cells or a 

population of cells of interest of the subject (e.g., tumor cells, tumor tissue sample, organ 
cells, blood cells, cells of the skin, lung, heart, muscle, brain, mucosae, liver, intestine, 
spleen, stomach, lymphatic system, cervix, vagina, prostate, mouth, tongue, etc.) are 
obtained or removed from the subject and contacted with an amount of a polypeptide of 

25 the invention that is effective in prophylactically or therapeutically treating the disease, 
disorder, or other condition. The contacted cells are then returned or delivered to the 
subject to the site from which they were obtained of to another site (e.g., including those 
defined above) of interest in the subject to be treated. If desired, the contacted cells may 
be grafted onto a tissue, organ, or system site (including all described above) of interest in 

30 the subject using standard and well-known grafting techniques or, e.g., delivered to the 
blood or lymph system using standard delivery or transfusion techniques. 
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The invention also provides in vivo methods in which one or more cells or a 
population of cells of interest of the subject are contacted directly or indirectly with an 
amount of a polypeptide of the invention effective in prophylactically or therapeutically 
treating the disease, disorder, or other condition. In direct contact/administration formats, 
5 the polypeptide is typically administered or transferred directly to the cells to be treated or 
to the tissue site of interest (e.g., tumor cells, tumor tissue sample, organ cells, blood cells, 
cells of the skin, lung, heart, muscle, brain, mucosae, liver, intestine, spleen, stomach, 
lymphatic system, cervix, vagina, prostate, mouth, tongue, etc.) by any of a variety of 
formats, including topical administration, injection (e.g., by using a needle or syringe), or 
10 vaccine or gene gun delivery, pushing into a tissue, organ, or skin site. The polypeptide 
can be delivered, for example, intramuscularly, intradermally, subdermally, 
subcutaneously, orally, intraperitoneally, intrathecally, intravenously, or placed within a 
cavity of the body (including, e.g., during surgery), or by inhalation or vaginal or rectal 
administration. 

15 In in vivo indirect contact/administration formats, the polypeptide is 

typically administered or transferred indirectly to the cells to be treated or to the tissue site 
of interest, including those described above (such as, e.g., skin cells, organ systems, 
lymphatic system, or blood cell system, etc.), by contacting or administering the 
polypeptide of the invention directly to one or more cells or population of cells from 

20 which treatment can be facilitated. For example, tumor cells within the body of the 
subject can be treated by contacting cells of the blood or lymphatic system, skin, or an 
organ with a sufficient amount of the polypeptide such that delivery of the polypeptide to 
the site of interest (e.g., tissue, organ, or cells of interest or blood or lymphatic system 
within the body) occurs and effective prophylactic or therapeutic treatment results. Such 

25 contact, administration, or transfer is typically made by using one or more of the routes or 
modes of administration described above. 

In another aspect, the invention provides ex vivo methods in which one or 
more cells of interest or a population of cells of interest of the subject (e.g., tumor cells, 
tumor tissue sample, organ cells, blood cells, cells of the skin, lung, heart, muscle, brain, 

30 mucosae, liver, intestine, spleen, stomach, lymphatic system, cervix, vagina, prostate, 
mouth, tongue, etc) are obtained or removed from the subject and transformed by 
contacting said one or more cells or population of cells with a polynucleotide construct 
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comprising a target nucleic acid sequence of the invention that encodes a biologically 
active polypeptide of interest {e.g., a polypeptide of the invention) that is effective in 
prophylactically or therapeutically treating the disease, disorder, or other condition. The 
one or more cells or population of cells is contacted with a sufficient amount of the 
5 polynucleotide construct and a promoter controlling expression of said nucleic acid 

sequence such that uptake of the polynucleotide construct (and promoter) into the cell(s) 
occurs and sufficient expression of the target nucleic acid sequence of the invention results 
to produce an amount of the biologically active polypeptide effective to prophylactically 
or therapeutically treat the disease, disorder, or condition. The polynucleotide construct 
10 may include a promoter sequence (e.g., CMV promoter sequence) that controls expression 
of the nucleic acid sequence of the invention and/or, if desired, one or more additional 
nucleotide sequences encoding at least one or more of another polypeptide of the 
invention, a cytokine, adjuvant, or co-stimulatory molecule, or other polypeptide of 
interest. 

15 Following transfection, the transformed cells are returned, delivered, or 

transferred to the subject to the tissue site or system from which they were obtained or to 
another site (e.g., tumor cells, tumor tissue sample, organ cells, blood cells, cells of the 
skin, lung, heart, muscle, brain, mucosae, liver, intestine, spleen, stomach, lymphatic 
system, cervix, vagina, prostate, mouth, tongue, etc.) to be treated in the subject. If 

20 desired, the cells may be grafted onto a tissue, skin, organ, or body system of interest in 

the subject using standard and well-known grafting techniques or delivered to the blood or 
lymphatic system using standard delivery or transfusion techniques. Such delivery, 
administration, or transfer of transformed cells is typically made by using one or more of 
the routes or modes of administration described above. Expression of the target nucleic 

25 acid occurs naturally or can be induced (as described in greater detail below) and an 

amount of the encoded polypeptide is expressed sufficient and effective to treat the disease 
or condition at the site or tissue system. 

In another aspect, the invention provides in vivo methods in which one or 
more cells of interest or a population of cells of the subject (e.g., including those cells and 

30 cells systems and subjects described above) are transformed in the body of the subject by 
contacting the cell(s) or population of cells with (or administering or transferring to the 
cell(s) or population of cells using one or more of the routes or modes of administration 
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described above) a polynucleotide construct comprising a nucleic acid sequence of the 
invention that encodes a biologically active polypeptide of interest {e.g., a polypeptide of 
the invention) that is effective in prophylactically or therapeutically treating the disease, 
disorder, or other condition. 
5 The polynucleotide construct can be directly administered or transferred to 

cell(s) suffering from the disease or disorder (e.g., by direct contact using one or more of 
the routes or modes of administration described above). Alternatively, the polynucleotide 
construct can be indirectly administered or transferred to cell(s) suffering from the disease 
or disorder by first directly contacting non-diseased cell(s) or other diseased cells using 

10 one or more of the routes or modes of administration described above with a sufficient 

amount of the polynucleotide construct comprising the nucleic acid sequence encoding the 
biologically active polypeptide, and a promoter controlling expression of the nucleic acid 
sequence, such that uptake of the polynucleotide construct (and promoter) into the cell(s) 
occurs and sufficient expression of the nucleic acid sequence of the invention results to 

15 produce an amount of the biologically active polypeptide effective to prophylactically or 
therapeutically treat the disease or disorder, and whereby the polynucleotide construct or 
the resulting expressed polypeptide is transferred naturally or automatically from the 
initial delivery site, system, tissue or organ of the subject's body to the diseased site, 
tissue, organ or system of the subject's body (e.g., via the blood or lymphatic system). 

20 Expression of the target nucleic acid occurs naturally or can be induced (as described in 
greater detail below) such that an amount of the encoded polypeptide is expressed 
sufficient and effective to treat the disease or condition at the site or tissue system. The 
polynucleotide construct may include a promoter sequence (e.g., CMV promoter 
sequence) that controls expression of the nucleic acid sequence and/or, if desired, one or 

25 more additional nucleotide sequences encoding at least one or more of another polypeptide 
of the invention, a cytokine, adjuvant, or co-stimulatory molecule, or other polypeptide of 
interest. 

In each of the in vivo and ex vivo treatment methods as described above, a 
composition comprising an excipient and the polypeptide or nucleic acid of the invention 
30 can be administered or delivered. In one aspect, a composition comprising a 

pharmaceutically acceptable excipient and a polypeptide or nucleic acid of the invention is 
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administered or delivered to the subject as described above in an amount effective to treat 
the disease or disorder. 

In another aspect, in each in vivo and ex vivo treatment method described 
above, the amount of polynucleotide administered to the cell(s) or subject can be an 
5 amount sufficient that uptake of said polynucleotide into one or more cells of the subject 
occurs and sufficient expression of said nucleic acid sequence results to produce an 
amount of a biologically active polypeptide effective to enhance an immune response in 
the subject, including an immune response induced by an immunogen {e.g., antigen). In 
another aspect, for each such method, the amount of polypeptide administered to cell(s) or 

10 subject can be an amount sufficient to enhance an immune response in the subject, 
including that induced by an immunogen {e.g., antigen). 

In yet another aspect, in an in vivo or in vivo treatment method in which a 
polynucleotide construct (or composition comprising a polynucleotide construct) is used to 
deliver a physiologically active polypeptide to a subject, the expression of the 

15 polynucleotide construct can be induced by using an inducible on- and off-gene expression 
system. Examples of such on- and off-gene expression systems include the Tet-On™ 
Gene Expression System and Tet-OfF M Gene Expression System {see, e.g., Clontech 
Catalog 2000, pg. 1 10-1 1 1 for a detailed description of each such system), respectively. 
Other controllable or inducible on- and off-gene expression systems are known to those of 

20 ordinary skill in the art. With such system, expression of the target nucleic of the 

polynucleotide construct can be regulated in a precise, reversible, and quantitative manner. 
Gene expression of the target nucleic acid can be induced, for example, after the stable 
transfected cells containing the polynucleotide construct comprising the target nucleic acid 
are delivered or transferred to or made to contact the tissue site, organ or system of 

25 interest. Such systems are of particular benefit in treatment methods and formats in which 
it is advantageous to delay or precisely control expression of the target nucleic acid (e.g., 
to allow time for completion of surgery and/or healing following surgery; to allow time for 
the polynucleotide construct comprising the target nucleic acid to reach the site, cells, 
system, or tissue to be treated; to allow time for the graft containing cells transformed with 

30 the construct to become incorporated into the tissue or organ onto or into which it has been 
spliced or attached, etc.) 
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INTEGRATED SYSTEMS 

The present invention provides computers, computer readable media and 
integrated systems comprising character strings corresponding to the sequence information 
herein for the polypeptides and nucleic acids herein, including, e.g., those sequences listed 
5 herein and the various silent substitutions and conservative substitutions thereof. 

Various methods and genetic algorithms (GOs) known in the art can be 
used to detect homology or similarity between different character strings, or can be used to 
perform other desirable functions such as to control output files, provide the basis for 
making presentations of information including the sequences and the like. Examples 

10 include BLAST, discussed supra. 

Thus, different types of homology and similarity of various stringency and 
length can be detected and recognized in the integrated systems herein. For example, 
many homology determination methods have been designed for comparative analysis of 
sequences of biopolymers, for spell-checking in word processing, and for data retrieval 

15 from various databases. With an understanding of double-helix pair-wise complement 
interactions among 4 principal nucleobases in natural polynucleotides, models that 
simulate annealing of complementary homologous polynucleotide strings can also be used 
as a foundation of sequence alignment or other operations typically performed on the 
character strings corresponding to the sequences herein (e.g., word-processing 

20 manipulations, construction of figures comprising sequence or subsequence character 

strings, output tables, etc.). An example of a software package with GOs for calculating 
sequence similarity or homology is BLAST, which can be adapted to the present invention 
by inputting character strings corresponding to the sequences herein. 

Similarly, standard desktop applications such as word processing software 

25 (e.g., Microsoft Word™ or Corel WordPerfect™) and database software (e.g., spreadsheet 
software such as Microsoft Excel™, Corel Quattro Pro™, or database programs such as 
Microsoft Access™ or Paradox™) can be adapted to the present invention by inputting a 
character string corresponding to the interferon alpha homologues of the invention (either 
nucleic acids or proteins, or both). For example, the integrated systems can include the 

30 foregoing software having the appropriate character string information, e.g., used in 
conjunction with a user interface (e.g., a GUI in a standard operating system such as a 
Windows, Macintosh or LINUX system) to manipulate strings of characters. As noted, 
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specialized alignment programs such as BLAST can also be incorporated into the systems 
of the invention for alignment of nucleic acids or proteins (or corresponding character 
strings). 

Integrated systems for analysis in the present invention typically include a 
5 digital computer with GO software for aligning sequences, as well as data sets entered into 
the software system comprising any of the sequences herein. The computer can be, e.g., a 
PC (Intel x86 or Pentium chip- compatible DOS™, OS2™ WINDOWS™ WINDOWS 
NT™, WINDOWS95™, WINDOWS98™ LINUX based machine, a MACINTOSH™, 
Power PC, or a UNIX based (e.g., SUN™ work station) machine) or other commercially 

10 common computer which is known to one of skill. Software for aligning or otherwise 
manipulating sequences is available, or can easily be constructed by one of skill using a 
standard programming language such as Visualbasic, Fortran, Basic, Java, or the like. 

Any controller or computer optionally includes a monitor which is often a 
cathode ray tube ("CRT") display, a flat panel display (e.g., active matrix liquid crystal 

15 display, liquid crystal display), or others. Computer circuitry is often placed in a box 
which includes numerous integrated circuit chips, such as a microprocessor, memory, 
interface circuits, and others. The box also optionally includes a hard disk drive, a floppy 
disk drive, a high capacity removable drive such as a writeable CD-ROM, and other 
common peripheral elements. Inputting devices such as a keyboard or mouse optionally 

20 provide for input from a user and for user selection of sequences to be compared or 
otherwise manipulated in the relevant computer system. 

The computer typically includes appropriate software for receiving user 
instructions, either in the form of user input into a set parameter fields, e.g., in a GUI, or in 
the form of preprogrammed instructions, e.g., preprogrammed for a variety of different 

25 specific operations. The software then converts these instructions to appropriate language 
for instructing the operation of the fluid direction and transport controller to carry out the 
desired operation. 

The software can also include output elements for controlling nucleic acid 
synthesis (e.g., based upon a sequence or an alignment of a sequences herein) or other 
30 operations which occur downstream from an alignment or other operation performed using 
a character string corresponding to a sequence herein. 
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In one embodiment, the invention provides an integrated system 
comprising a computer or computer readable medium comprising a database having one or 
more sequence records. Each of the sequence records comprises one or more character 
strings corresponding to a nucleic acid or polypeptide or protein sequence selected from 
5 SEQ ID NO:l to SEQ ID NO:85. The integrated system further comprises a use input 

interface allowing a use to selectively view the one or more sequence records. In one such 
integrated system, the computer or computer readable medium comprises an alignment 
instruction set that aligns the character strings with one or more additional character 
strings corresponding to a nucleic acid or polypeptide or protein sequence. 

10 One such integrated system includes an instruction set that comprises at 

least one of the following: a local homology comparison determination, a homology 
alignment determination, a search for similarity determination, and a BLAST 
determination. In some embodiments, the system further comprises a readable output 
element that displays an alignment produced by the alignment instruction set. In another 

15 embodiment, the computer or computer readable medium further comprises an instruction 
set that translates at least one nucleic acid sequence which comprises a sequence selected 
from SEQ ID NO: 1 to SEQ ED NO:35 or SEQ ID NO:72 to SEQ ID NO:78 into an amino 
acid sequence. The instruction set may select the nucleic acid by applying a codon usage 
instruction set or an instruction set which determines sequence identity to a test nucleic 

20 acid sequence. 

Methods of using a computer system to present information pertaining to at 
least one of a plurality of sequence records stored in a database are also provided. Each of 
the sequence records comprises at least one character string corresponding to SEQ ID 
NO:l to SEQ ID NO:85. The method comprises determining at least one character string 

25 corresponding to one or more of SEQ ID NO:l to SEQ ID NO:85 or a subsequence 

thereof; determining which of the at least one character string of the list are selected by a 
user; and displaying each of the selected character strings, or aligning each of the selected 
character strings with an additional character string. The method may further comprise 
displaying an alignment of each of the selected character strings with an additional 

30 character string and/or displaying the list. 
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KITS 

In an additional aspect, the present invention provides kits embodying the 
methods, composition, systems and apparatus herein. Kits of the invention optionally 
comprise one or more of the following: (1) an apparatus, system, system component or 
5 apparatus component as described herein; (2) instructions for practicing the methods 
described herein, and/or for operating the apparatus or apparatus components herein 
and/or for using the compositions herein; (3) one or more alpha interferon homologue 
compositions (such as e.g., compositions comprising at least one interferon alpha 
homologue nucleic acid or polypeptide or fragment thereof, cell, vector, etc., of the 

10 invention) or components (interferon alpha homologue nucleic acid or polypeptide or 
fragment thereof, cell, vector, etc., of the invention); (4) a container for holding one or 
more aspects of the invention, including such components or compositions, and (5) 
packaging materials. 

In a further aspect, the present invention provides for the use of any 

15 apparatus, apparatus component, composition or kit herein, for the practice of any method 
or assay herein, and/or for the use of any apparatus or kit to practice any assay or method 
herein. 

EXAMPLES 

EXAMPLE I: PREPARATION AND SCREENING OF SHUFFLED INTERFERON- 
20 ALPHA LIBRARIES 

Fragments (25-60 base pairs (bp) in length) of about 20 human interferon- 

alpha subspecies genes were prepared by PGR amplification and DNAse treatment, and 

recombined essentially as described in Crameri A. et al. (1998; Nature 15:288-291), to 

produce shuffled interferon-alpha mature coding sequences. Expression libraries were 

25 prepared by subcloning shuffled interferon-alpha mature coding sequences into an E. coli 

secretion vector. Shuffled interferon polypeptides were expressed as mature proteins 

fused at the C-termini to an E tag (Amersham-Pharmacia) to facilitate quantitation and 

purification from the periplasmic space. E. coli transformants were picked using a robotic 

colony picker (Q-Bot, Genetix Pharmaceuticals) into microtiter plates, and periplasmic 

30 extracts were prepared. 

Periplasmic extracts were assayed for antiproliferative activity on a human 

Daudi cell line as described by Scarozza, A.M. et al. (1992) J. Interferon Res. 12:35-42. 
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Clones exhibiting antiproliferative activity in the Daudi assay were re- 
screened and expression levels determined by Western blot using an anti-E tag antibody 
(Amersham-Pharmacia). Clones exhibiting highest activity normalized to expression 
levels were selected for sequencing and were also utilized as substrates for additional 
5 rounds of shuffling and screening as described above. 

Clones from the first and second rounds of shuffling having relatively high 
antiproliferative activity by the Daudi assay were subcloned into a CHO expression vector 
(pDEI-101 1) in which the E-tag/6-His tag (Amersham-Pharmacia) is fused to the C- 
terminus of the shuffled interferons. Clones were transfected into CHO cells and stable 
10 cell lines were selected with 1 mg/ml G418. CHO-expressed mature interferons were 
purified on anti-E tag Sepharose column (Amersham-Pharmacia) and quantitated by a 
Bradford assay (Biorad). CHO-purified shuffled interferons were assayed for 
antiproliferative activity by the Daudi assay and for antiviral activity using a human WISH 
cell/EMCV assay as described below. 

15 Human WISH cell / EMCV antiviral assay 

WISH cells were seeded to a density of 6 x 10 4 cells/well in 96-well plates 

in 100 ul RPMI medium (Gibco-BRL) supplemented with 10% fetal calf serum, penicillin 

(100 |ag/ml), and streptomycin (100 ng/ml), and incubated for 24 hours at 37°C. Samples 

of interferon-alpha polypeptides in medium (100 ^1 total volume) were added to wells and 

20 incubated for 3 hours at 37°C under a 5% C0 2 atmosphere. Dilutions of EMCV 

(encephalomyocarditis virus) were added to wells in 50 jil volumes, and incubated for 24 
hours as above. Medium was carefully removed and wells were rinsed 2x with warm 
phosphate-buffered saline (PBS). Neutral red (100 jxl/well of 1:50 dilution in medium) 
was added to the wells and incubated for 2 hours as above. Glutaraldehyde (50 jil/well of 

25 0.5% in PBS) was added and incubated for 30 minutes as above. Wells were washed 2x in 
PBS, and 100 nl/well of a solution of 50% methanol, 1% acetic acid was added. 
Absorbance at 540 nanometers (nm) was measured using a microplate reader. 

Fig. 2 shows the antiproliferative activity and the antiviral activity of exemplary 
interferon homologues of the invention, in comparison with interferon alpha-2a and 

30 interferon-alpha Conl. The graph shows the number of Units activity per milligram of 
homologue (Y axis) for a set of exemplary interferon alpha homologues, each of which is 
designated with a "name" on the X axis. 
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EXAMPLE 2: IN VITRO CANCER CELL LINE SCREEN 

An in vitro cell line screen (as described in, e.g., Monks, A. et al. (1991) J. Nat'l 
Cancer Inst. 83:757-766 (hereinafter "Monks") and 

http://dtp,nci.gov,/branches/btb/ivclsp.htmK each of which is incorporated herein by 
5 reference in its entirety for all purposes) was used to assay interferon-alpha homologues of 
the invention for selective growth inhibition and/or cell killing of particular cancer cell 
lines. The 60 human cancer cell lines used (Table 3) include leukemias, melanomas, and 
cancers of the lung, colon, brain, ovary, breast, prostate, central nervous system, renal 
system, and kidney. Human tumor cell lines were grown according to procedures outlined 
10 in Monks") and http://dtp.nci, gov ,/branches/btb/i vclsp.html . 



Table 3 

Human cancer cell lines screened 



Cancer type 


Cell lines 


Leukemia 


CCRF-CEM, HL-60 (TB), K-562, MOLT-4, RPMI-8226, SR 


Colon cancer 


COLO 205, HCC-2998, HCT-15, HCT-1 16, HT29, KM12, SW-620 


CNS cancer 


SF-268, SF-295, SF-539, SNB-19, SNB-75, U251 


Lung cancer 


A549/ATCC, EKVX, HOP-62, HOP-92, NCI-H23, NCI-H226, NCI- 
H322M, NCI-H460, NCI-H522 


Breast cancer 


MCF-7, NCI/ADR HS578T, MDA-MB-23 1/ATCC, MDA-MB-435, 
MDA-N, BT-549, T-47D 


Melanoma 


LOX IMVI, M14, MALME-3M, SK-MEL-2, SK-MEL-5, SK-MEL- 
28, UACC-62, UACC-257 


Ovarian cancer 


IGROV1, OVCAR-3, OVCAR-4, OVCAR-5, OVCAR-8, SK-OV-3 


Prostate cancer 


DU-145, PC-3 


Renal cancer 


786-0, A498, ACHN, CAKI-1, RXF 393, SN12C, TK-10, UO-31 



15 Briefly, cells were inoculated into 96 well microtiter plates at densities 

ranging from about 5,000 to about 40,000 cells/well, depending on the growth properties 
of the particular cell line. After inoculation, the microtiter plates were incubated for 24 
hours (h) at 37 degrees C prior to addition of test samples (e.g., interferon homologues of 
the invention or control interferons). After 24 h, two plates of each cell line were fixed in 

20 situ with trichloroacetic acid (TCA), to provide a measurement of the cell population for 
each cell line at the time of test sample addition (T 0 ). To the remaining plates, interferon 
samples (affinity-purified from CHO cell supernatants) were added in five 10-fold serial 
dilutions ranging from 10 ° 8 to 10* 4 8 jig/ml. 
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Following sample addition, the plates were incubated for an additional 6 
days. The assay was terminated by addition of TCA. 

Cell population was determined by measuring cellular protein in a 
quantitative protein dye-binding assay. Sulforhodamine B solution (100 /il) at 0.4 % (w/v) 
5 in 1% acetic acid was added to each well, followed by incubation for 10 minutes at room 
temperature. Unbound dye was removed by washing five times with 1% acetic acid and 
the plates air- dried. Protein-bound dye was solubilized with 10 milliMoIar (mM) Tris, 
and the absorbance read at 515 nanometer (nm) on an automated plate reader . 

Seven absorbance measurements were taken for each dose-response assay, 
10 corresponding to: the amount of cellular protein prior to sample addition (time zero; To), 
the amount of cellular protein at the end of the incubation period in the absence of test 
sample (control growth, C), and five measurements corresponding to the amount of 
cellular protein at the end of the incubation period in the presence of each of the five 
concentrations of interferon test sample (test growth in presence of interferon test sample 
15 at the five concentration levels, Ti). These measurements were used to calculate the 
following three parameters for each test sample: 

GI50, or "growth inhibition of 50%, M is the concentration of interferon test 
sample at which cell growth is inhibited by 50%, as measured by a 50% reduction in the 
net protein/polypeptide increase in the interferon test sample as compared to that observed 
20 in the control cells (no test sample) at the end of the incubation period. GI50 is calculated 
as the concentration of test sample where [(Ti-T 0 )/(C-To)] x 100 = 50. See Fig. 3A. 

TGI, or "total growth inhibition," is the concentration of interferon test 
sample at which cell growth is totally inhibited, wherein the amount of cellular protein at 
the end of the incubation period equals the amount of cellular protein at the beginning of 
25 the incubation period. The concentration of interferon test sample that produces total 
growth inhibition (TGI) is calculated as the concentration of test sample where T§ = To. 

LC50 is the concentration of interferon test sample at which a 50% 
reduction in the measured amount of cellular protein at the end of the incubation as 
compared to that at the beginning of the incubation period is observed, indicating a net 
30 loss of cells following interferon test sample addition. LC50 is calculated as the 
concentration of test sample where [(T»-To)/T 0 ] x 100 = -50. 



107 



WO 01/25438 



PCT/US00/27781 



If, for a particular test sample, an effect was not achieved or was exceeded 
at the concentration range tested, the value for that parameter was expressed as greater or 
less than the maximum or minimum concentration tested. 



5 EXAMPLE 3: IN VITRO ACTIVITY OF IFN- ALPHA HOMOLOGUES CORRELATES 
WITH IN VIVO EFFICACY. 

Fragments of human interferon-alpha genes were shuffled and screened for 
activity in a murine cell-based antiviral assay as described by Chang et al. (1999) Nature 
BiotechnoL 17:793-797 '. Interferon-alpha homologues that exhibited over 10 5 -fold higher 

10 antiviral activity than human interferon-alpha 2a against mouse cells were isolated. The 
antiviral activities of a number of the interferon-alpha homologues even significantly 
exceeded the antiviral activity of native mouse interferons, including Mu-IFN-alpha 4 
(Chang et al., supra). Recursive sequence recombination (e.g., DNA shuffling) of human 
interferon-alpha gene fragments to produce novel interferon alpha homologues and 

15 subsequent screening of such homologues against murine interferon receptors resulted in 
the identification and isolation of interferon-alpha homologues with activity optimized for 
the distantly related murine species. 

A dose-response study in mice was performed to determine if the high 
antiviral activity observed in vitro is sustained in vivo. Two of the mouse-optimized 

20 interferon-alpha homologues, designated herein as CH2.2 and CH2.3 (SEQ ID NOS:84 
and 85, respectively), were used in this study. CH2.2 and CH2.3 were shown to have 
about 138,000-fold and about 206,00-fold higher activity, respectively, than human 
interferon-alpha 2a, and about 2.5-fold and about 1.6-fold higher activity than native 
mouse interferon-alpha 4, in the in vitro mouse cell antiviral assay (Chang et al, supra). 

25 Groups of Balb/c mice received subcutaneous doses of either phosphate 

buffered saline (PBS), interferon-alpha homologue CH2.2, interferon-alpha homologue 
CH2.3, murine IFN-alpha 4, or human interferon-alpha 2a, in daily subcutaneous doses of 
2, 10, or 50 ^ig (total volume of 50 for four consecutive days. On day 2, the mice were 
exposed to a lethal intranasal dose (ten times the LC50) of vesicular stomatitis virus 

30 (VSV). Data is expressed as the number of mice which survive to day 21. 

Fig. 5 shows that both of the mouse-optimized interferon-alpha 
homologues, CH2.2 and CH2.3, were as effective or more effective than native murine 
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interferon Mu-IFN alpha 4 in protecting mice from VSV. At the concentrations tested, 
human IFN-alpha 2a was nearly completely ineffective in protecting mice from the virus. 
Thus, the in vivo efficacy of the interferon-alpha homologues of the invention correlates 
remarkably well with the antiviral activities observed in the in vitro assays. 
5 While the foregoing invention has been described in some detail for 

purposes of clarity and understanding, it will be clear to one skilled in the art from a 
reading of this disclosure that various changes in form and detail can be made without 
departing from the true scope of the invention. For example, all the techniques, methods, 
compositions, apparatus and systems described above may be used in various 
10 combinations. All publications, patents, patent applications, or other documents cited in 
this application are incorporated herein by reference in their entirety for all purposes to 
the same extent as if each individual publication, patent, patent application, or other 



document were individually indicated to be incorporated by reference for all purposes, 

SEQUENCES 

15 



SEQ ID 


Clone 
ID 


Sequence 


SEQ ID NO:l 


2DH12 


TGTGATCTGCCTCAGACCCACAGCCTTGGCAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGACC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTAGGGGTGAAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:2 


2CA3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTGACAGGAGGGCCATGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAGCTGAATGAACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGGAGAGACTCCCCTGATG 
AATGGGGACTC CATC CTGGCTGTGAAGAAGTACTTCCAAAGAATC ACT 
C TTT ATCTAAT AG AG AGG AAAT AC AGC CC TTGTGC ATGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:3 


4AB9 


TGTGATCTGCCTCAGACCCACAGCCTTGGCAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCGGGAGGAGTTTGATGGCAACCAGTTC 
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CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCGGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGAACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA f V - 


SEQ ID NO:4 


2DA4 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATAGCAACCAGTTC 
C AG AAGGCTC AAGC CATC TC TGTC CTCC ATG AG ATG ATGC AGC AG AC C 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 5 

• 


3DA11 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGGTA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:6 


2DB11 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAGCTGAATGACTTG 
GAAGCCTGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:7 


2CA5 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCGGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
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GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 8 


2G6 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACCCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:9 


3AH7 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGCGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATAGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTCACCAGCAACTGAATGAACTG 
GAAGCATGTGTAGTACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACCTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 10 


2G5 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 

AGATTAAGGAGGAAGGAA 


SEQ ID NO: 11 


2BA8 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCCTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTAATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
C TTT ATCT AAT AG AG AGG AAAT AC AGC CCTTGTGC ATGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 
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SEQIDNO:12 


1F3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGGACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAACCTCTTCAGCACAAAGGACTCATCTGTTGCTTGGGATGAGAGG 
CTTCTAGACAAACTCTATACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGTGTGATGCAGGAGGTGTGGGTGGGAGGGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAGAAAATACTTCCAAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQlDNO:13 


4BE10 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAGATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATAATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGAACTG 
GAAGCATGTGTGATACAGGGGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCTTGGCTGTGAGGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:14 


2DD9 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCCCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGGACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:15 


3CA1 


TGTGATCTGCCTCAGACCCACAGCCTTGGCAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTACCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCC TAGAAAAATTTTCC ACTGAACTTTACC AGC AACTGAAT AAC CTG 
GAAGCATGTGTGATACAGGAGGTTGGGATGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 16 


2F8 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
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CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGAACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:17 


6CG3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAAGAGGGCCATGATG 
CTCCTGGCACAAATGGGAAGAACCTCTCCTTTCTCCTGTCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAGGGCTCAAGCCATCTTTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATTTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAAGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:18 


3CG7 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGTAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCCCCTTTCTCCTGCCTGAAGGAC 
AGACATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGCCTTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAAC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCATGTGTGATACAGGAGGTTGGGATGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
C TTT ATCTAAT AG AG AGG AAAT AC AGC C CTTGTGC C TGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:19 


1D3 


TGTGATC TGC C TC AG AC C C AC AGCCTTGGTAAC AGG AGGGCC TTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGACATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCCACCAGTTC 
CAGAAGACTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGACCTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 20 


2G4 


TGTGATC TGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCATGATG 
CTCCTGGCACAAATGAGCAGAATCTCTCCTTCCTCCTGTCTGATGGAC 
AGACATGACTTTGAATTTCCCCAGGAGGAATTTGATGATAAACAGTTC 
CAGAAGGCTCCAGCCATCTCTGTCCTCCATGAGGTGATTCAGCAGACC 
TTCAATCTCTTCAGCACAGAGGACTCATCTGCTGCTTGGGAACAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGACCTG 
GAAGCATGTGTGATGCAGGAGGAGAGGGTGGGAGAAACTCCCCTGATG 
AATGCGGACTCCATCTTGGCTGTGAGGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAAAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
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AGATTAAGGAGGAAGGAA 


SEQlDNO:21 


1A1 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGTGTTTGATGGCAACCAGTTC 
CAGAAGGCCCAAGCCATCTCTGCCTTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAGAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTCACCAGCAACTGAATGACCTG 
GAAGCCTGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAGGAAATACTTTCAAAGAATCACT 
CTTTATCTAATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 22 


1D10 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGACATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCCACCAGTTC 
CAGAAGACTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTC C AC TG AACTTTACC AGC AAC TGAATG AC CTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:23 


1F6 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
ATAATGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTTCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGCTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AG AGC AG AAATC ATG AG ATC TTTC TCTTTTTC AAC AAAC TTGC AAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:24 


2A10 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGTGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGCCTTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
G AAGC ATGTGTGATAC AGGAGGTTGGGGTGG AAG AG AC TCC CC TG ATG 
AATGAGGACTCCATCCTGGCTGTGAGGAAATACTTTCAAAGAATCACT 
CTTTATCTGATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:25 


2C3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTTCCTCAGGAGGAGTTTGATGGCAACCAGTCC 
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CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGATACTTGGGATGCGACC 
CTTTTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:26 


2D1 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
C TCC TGGC AC AAATGGG ACG AATC TC TCCTTTCTCCTGC C TGAAGG AC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCGGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AG AGC AG AAATC ATG AG ATC TTTCTC TTTTTC AAC AAAC TTGC AAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:27 


2D 10 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
C TCC TGGC AC AAATGGGAAGAGTCTC TCCTTTCTCCTGC CTGAAGG AC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGCCTTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:28 


2D7 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGCGGGCCTTGATA 
C TCC TGGC AC AAATGGG AAGAATCTCTCCTTTCTCCTGTCTGAAGG AC 
AGACATGACTTCAGATTTCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCTTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCTATCCTGGCTGTGAAGAAATACTTCCAAAG AATC ACT 
CTTTATCTGACAGAGAGGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:29 


2D9 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACT 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGGTG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 



115 



WO 01/25438 



PCT/US00/27781 







CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 30 


2DA2 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGCCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAGGACTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCCACCAGCAACTGAATGAACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:31 


2DH9 


TGTGATCTGCCTCAGACCCACAGCCCTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGGGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCGGCAGCTGAATGACCTG 
GAAGCCTGTGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
C TTT ATC TG AC AG AG AAG AAG C AT AGC C C TTGTTC C TGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:32 


2G11 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGACTTCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGACTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGATACTTGGGAACAGAGC 
CTCCTAGAAAAATTCTACATTGAACTTTTCCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGAAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGGAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:33 


2G12 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
CTCATGGCACAAATGAGGAGAATCTCTCCTTTCCCCCGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGTGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCTATCTTCCTTTTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGACAAATTCTACACTGAACTCTACCAGCAGCTGAATGACTTG 
GAAGCCTGTGTGATGCAGGAGGGGAGGGTGGGAGAAACTCCCCTGATG 
AATGCGGACTCCATCTTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGCTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 
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SEQ ID NO:34 


2H9 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTC AATC TCTTC AGC AC AAAGG ACTC ATC TGC T AC TTGGG AAC AG AGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTA 
GAAGCCTGTGTGACACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCTATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
C TTTATC TG AC AG AGAAG AAATAC AGCCCTTGTGCCTGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:35 


6BC11 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGCTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGAGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTG AC AG AG AGG AAATAC AGCCCTTGTGCCTGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:36 


2DH12 


CDL PQTHS LGNRRALMLL AQMGRI S PF SCLKDRQDFGF PQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSAAWEQTLLEKFSTELYQQLNDL 
EACVIQEVGVKETPLMNVDSILAVRKYFQRITLYLIERKYSPCAWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:37 


2CA3 


CDL PQTHSLGDRRAM I LLAQMGRISPFSCLKDRYDFGF PQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSAAWEQSLLEKFSTELYQQLNEL 
E ACVIQEVGVGETPLMNGDS I L AVKKYFQRI TLYL I ERKYS PC AWE W 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:38 


4AB9 


CDL PQTHSLGNRRAL I LL AQMGRI S PF SCLKDRHDFGF PREEFDGNQF 
QKAQAISVLHEMMQQTFNLFSTKNSSAAWDETLLEKFSTELYQQLNEL 
EACVIQEVGVEETPLMNEDSILAVKKYFQRITLYLTEKKYSPCSWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:39 


2DA4 


CDLPQTHSLGNRRALMLLAQMGRISPFSCLKDRQDFGFPQEEFDSNQF 
QKAQAISVLHEMMQQTFNLFSTKDSSAAWDETLLEKFSTELYQQLNDL 
EACVIQEVGVEETPLMNVDSILAVRKYFQRITLYLIERKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:40 


3DA11 


CDLPQTHSLGNRRALVLLAQMGRISPFSCLKDRYDFGFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSAAWDETLLEKFSTELYQQLNDL 
EACVIQEVGVEETPLMNEDSILAVKKYFQRITLYLIERKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:41 


2DB1I 


CDLPQTHSLGNRRALMLLAQMGRISPFSCLKDRYDFGFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSAAWDETLLEKFSTELYQQLNDL 
EACVI QEVGVEETPLMNVDS I L AVRKYFQRITLYL IERKYS PC AWEW 
RAE IMRSFSF STNLQKRLRRKE 


SEQ ED NO:42 


2CA5 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRQDFGFPQEEFDGNRF 
QKAQAISVLHEMIQQTFNLFSTKNSSAAWEQSLLEKFSTELYQQLNDL 
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EACVIQEVGVEETPLMNEDSILAVKKYFQRITLYLIERKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:43 


2G6 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQ AI S VLHEMIQQTFNLF STKDS SATWEQ SLLEKF STELNQQLNDL 
EACVIQEVGVEETPLMNVDPILAVKKYFQRITLYLTEKKYSPCAWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:44 


3AH7 


CDLPQTHSLGNRRALILLAQMRRISPFSCLKDRHDFGFPQEEFDSNQF 
QKAQAI S VLHEMIQQTFNLF STKDS SAAWEQSLLEKFSTELHQQLNEL 
EACVVQEVGVEETPLMNEDSILAVKKYLQRITLYLTEKKYSPCAWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:45 


2G5 


CDLPQTHSLGNRRALMLLAQMGRISPFSCLKDRQDFGFPQEEFDGNQF 
QKAQ AI SVLHEMI QQTFNLF STKDS SAAWEQ SLLEKF S TEL YQQLNDL 
EACVI QEVGVEET PLMNVDS I L AVRKYFQRITL YL I ERKYS PC AWE W 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:46 


2BA8 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPQEEFDGNQF 
QKAQAI SVLHEMI QQTFNLF STKDS SAAWEQSLLEKFSTEL YQQLNDL 
EACVI QEVGVEET PLMNVDS I LAVRKYFQRITL YL I ERKYS PC AWE W 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:47 


1F3 


CDLPQTHSLGNRRALILLGQMGRISHFSCLKDRHDFGFPQEEFDGNQF 
QKAQAI SVLHEMIQQTFNLFSTKDSSVAWDERLLDKLYTELYQQLNDL 
EACVMQEVWVGGTPLMNEDS I LAVRKYFQRITLYLTEKKYS PC AWE W 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:48 


4BE10 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPQEEFDGNQF 
QKAQ AISVLHEIMQQTFNLFSTKNSSAAWDETLLEKFS TEL YQQLNEL 
EACVIQGVGVEETPLMNEDSILAVRKYFQRITLYLTEKKYSPCSWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:49 


2DD9 


CDLPQTHSLGNRRALMLLAQMGRISPFSCLKDRYDFGFPQEEFDGNQF 
QKAQAI S VLHEMIQQTFNLF STKDS SAAWEQSLLEKFSTGL YQQLNDL 
EACVI QEVGVEET PLMNEDS I LAVKKYFQRI TL YLTEKKYS PC SWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:50 


3CA1 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGLPQEEFDGNQF 
QKAQAI SVLHEMI QQTFNLF STKNSSAAWDETLLEKFSTELYQQLNNL 
EACVIQEVGMEETPLMNVDSILAVKKYFQRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQIDNO:51 


2F8 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPQEEFDGNQF 
QKAQAI SVLHEMMQQTFNLFSTKNSSAAWDETLLEKFSTELYQQLNEL 
EACVIQEVGVEETPLMNEDSILAVKKYFQRITL YLTEKKYS PCSWEVV 
RAE IMRS F S F STNLQKRLRRKE 


SEQ ID NO: 52 




CDLPQTHSLGNKRAMMLLAQMGRTbPFbCLKJJKxlDr br FyUilir JJljiNiyr 
QRAQAI FVLHEMIQQTFNFFSTKDS SAAWEQSLLEKF STELNQQLNDL 
EACVIQEVGVEETPLMNEDSILAVKKYFQRITLYLTEKKYSPCAWEW 
RAE IMRS F S F S TNLQKRLRRKE 


SEQ ID NO:53 


3CG7 


CDLPQTHSLGNSRALMLLAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAI S AFHEMI QQTFNLF STKDS SAAWEQNLLEKFSTELYQQLNNL 
E AC VIQEVGMEETPLMNVDS I L AVRKYFQRITL YL I ERKYS PCAWEW 
RAE IMRS F S F S TNLQKRLRRKE 


SEQ ID NO: 54 


1D3 


CDLPQTHSLGNRRAL I LL AQMGRI SHFSCLKDRHDFGF PQEEFDGHQF 
QKTQAI SVLHEMI QQTFNLFSTKDS SAAWEQSLLEKFSTEL YQQLNDL 
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liAL.V±yi!jVVj» Villi I FL»JX1JN1ii1JoXJjAVJSJS.Y r v-^J- 1 Jj x LiJXIjIivJxx oFL-AWJl V V 

RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:55 


2G4 


CDLPQTHSLGNRRAmLLAQMSRISPSSCLMDRHDFEFPQEEFDDKQF 
QKAPAISVLHEVIQQTFNLFSTEDSSAAWEQTLLEKFSTELYQQLNDL 
fcALVMy kiiKVCjli I Fijr<LINiAlJoXljAVKJ\Yr yKX 1 Jb Y J_i 1 JvtvtvY 0FL.0WXWV 
RAE IMRSF SF STNLQKRLRRKE 


SEQ ID NO: 56 


1A1 


CDLPQTHSLGNRRALILLAQMGRISHFSCLKDRYDFGFPQEVFDGNQF 
QKAQAISAFHEMMQQTFNLFSTEDSSAAWEQSLLEKFSTELHQQLNDL 
IlAuVXyEVvjViliVl PJLiJ>lN W^tX 1 Xi YljMliJsJxYbFt-AWIliVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO: 57 


1D10 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFRFPQEEFDGNQL 
QKTQAISVLHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELNQQLNDL 
bAL ViyCaVCjrVEE TPPiXuNI VXJoXljAVKKYr yKX 1 X1YX1 1 bKKibrLAWhVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:58 


1F6 


CDLPQTHSLGNRRTLMIMAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELNQQLNDL 
EAL VIQEAQjVLYci rPXiMN VXJJb X XiAVKK YryKITLYLTEKKYbPCAWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:59 


2A10 


CDLPQTHSLGNRRALILLAQMGRISHFSCLKDRYDFGFPQEVFDGNQF 
QKAQAISAFHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELYQQLNNL 
EACVI QEVGVEETPLMNEDS I LAVRKYFQRITLYLMEKKYS PC AWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:60 


2C3 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFDGNQS 
QKAQAISVLHEMIQQTFNLFSTKDSSDTWDATLLEKFSTELNQQLNDL 
EACVI QEVGVEETPLMNVDS I LAVKKYFQRITLYLTEKKYS PC AWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:61 


2D1 


CDLPQTHSLGNRRALILLAQMRRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAISAFHEMIQQTFNLFSTKDSSAAWEQSLLEKFSTELYQQLNNL 
EACVIQEVGMEETPLMNEDS I LAVKKYFQRITLYLTEKKYS PC AWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:62 


2D 10 


CDLPQTHSLGNRRALILLAQMGRVSPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAISAFHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELYQQLNNL 
EACVI QEVGVEETPLMNVDS I LAVKKYFRRI TL YLTEKKYS PC AWE W 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:63 


2D7 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFRFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELYQQLNNL 
E AC VIQEVGVEETPLMNVD S I L AVKKYFQRI TLYLTERKYS PCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


e'en TT\ XTfV/^/l 




v^iXbFy IHbijONKKAXjXLLAyrlGKXbPr oUiji'UJKhllJrVjr FyX^lir IXjiNiyr 
QKAQAI S VLHEMI QQTFNLF STKDS SATWEQSLLEKF STELNQQLNDL 
EAC VI QEVGVEETPLVNVDS I LAVKKYFQRITLYLTEKKYS PCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:65 


2DA2 


CDLPQTHSLGNRRPLILLAQMGRISPFSCLKDRQDFGFPQEEFDGNQF 
QKAQAISVLHEMMQQTFNLFSTKNSSAAWEQSLLEKFSTELHQQLNEL 
EACVIQEVGVEETPLMNVDSILAVKKYFQRITLYLIERKYSPCAWEW 
RAE IMRSFSF STNLQKRL RRKE 


SEQ ID NO:66 


2DH9 


CDL PQTH S PGNRRALMLL AQMGR I S PF S C LKDRYDFGFPQGEFDGNQF 
QKAQAI SVLHEMMQQTFNLF STKDS SAAWEQSLLEKFSTELYRQLNDL 
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EACVIQEVGVEETPLMNVDSILAVRKYFQRITLYLTEKKHSPCSWEVV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:67 


2G11 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGLPQEEFDGNQF 
QKTQAISVLHEMIQQTFNLFSTKDSSDTWEQSLLEKFYIELFQQLNDL 
EACVIQEVGVEETPLMNVDSILAVRKYFQRITLYLTEEKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:68 


2G12 


CDLPQTHSLGNRRTLMLMAQMRRISPFPRLKDRYDFGFPQEVFDGNQF 
QKAQAI FLFHEMMQQTFNLFSTKNS S AAWDETLLDKFYTELYQQLNDL 
EACVMQEGRVGETPLMNADS ILAVKKYFRRITLYLTEKKYS PCAWEAV 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:69 


2H9 


CDLPQTHSLGNRRAL ILLAQMGRI S PFSCLKDRHDFGFPQEEFDGNQF 
QKAQAI SVLHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELNQQLNDL 
E AC VTQE VGVEETPLMNEDS I L AVKKYFQRI TLYLTEKKYS PC AWE W 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:70 


6BC11 


CDLPQTHSLGNRRAL I LLAQMGRISPFSCLKDRYDFGFPQEEFDGNQL 
QKAQAI SVLHEMIQQTFNLFSTKDSSAAWEQSLLEKFSTELNQQLNDL 
EACVIQEVGVEETPLMNVDSILAVKKYFQRITLYLTERKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQIDNO:71 


tl9bb 


CDLPQTHSLGXXRAXXLLXQMXRXSXFSCLKDRXDFGXPXEEFDXXXF 
QXXQAIXXXHEXXQQTFNXFSTKXSSXXWXXXLLXKXXTXLXQQLNXL 
EACVXQXVXXXXTPLMNXDXILAVXKYXQRITLYLXEXKYSPCXWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:72 


CH1.1 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGTCTGATGGAC 
AGACATGACTTTGGATTTCCCCAGGAGGAGTTTGATGACAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAACAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGATGAGACA 
CTTCTAG AC AAATTCTAC AC TG AAC TTTAC C AGC AGC TGAATGAC CTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCTTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 73 


CH1.2 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AG AC ATG ACTTTGG ATTC C C CC AGG AGGAGTTTG ATGGC AACC AGTTC 
CAGAAGGCTCAAGGCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCCATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:74 


CH1.3 


TGTG ATC TGCCTC AG ACC C AC AGCC TTGGTAAC AGG AGGAC TTTG ATG 
ATAATGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AG AC ATG AC TTTGG ATTTC C TC AGG AGG AG TTTG ATGGC AAC C AGTTC 
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CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGATGAGACA 
CTTCTAGACAAATTCTACACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGTATGATGCAGGAGGTTGGAGTGGAAGACACTCCTCTGATG 
AATGTGGACTCTATCCTGACTGTGAGAAAATACTTTCGAAGAATCACT 
CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:75 


CHI. 4 


TGTGATCTGCCTCAGACCCACAGCCTGGGTAATAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGGTGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAGAGGACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGACAAATTCTACATTGAACTTTTCCAGCAACTGAATGACCTG 
GAAGCCTGTGTGATGCAGGAGGAGAGGGTGGGAGAAACTCCCCTGATG 
AATGCGGACTCCATCTTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATC TG AC AG AG AAG AAAT AC AGC C CTTGTGCCTGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO: 76 


CH2.1 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
ATAATGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTTCCTCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGATGAGACA 
CTTCTAGACAAATTCTACACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGTATGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCTTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTG AC AG AG AAG AAAT AC AGCC CTTGTGCCTGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ED NO: 77 


CH2.2 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGTCTGATGGAC 
AGACATGACTTTGGATTTCCCCAGGAGGAGTTTGATGACAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAACAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGATGAGACA 
CTTCTAGACAAATTCTACACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGTATGATGCAGGAGGTTGGAGTGGAAGACACTCCTCTGATG 
AATGTGGACTCTATCCTGACTGTGAAGAAATACTTCCGAAGAATCACT 
CTTTATCTGAC AG AG AAG AAAT AC AGCC CTTGTGCCTGGG AGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:78 


CH23 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
ATAATGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTTCCTCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGATGAGACA 
CTTCTAGACAAATTCTACACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGTATGATGCAGGAGGTTGGAGTGGAAGACACTCCTCTGATG 
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AATGAGGACTCCATCTTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTCTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:79 


CH1.1 


CDLPQTHSLGNRRALILLAQMGRISPFSCLMDRHDFGFPQEEFDDNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWDETLLDKFYTELYQQLNDL 
EACVIQEVGVEETPLMNEDSILAVKKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:80 


CHI. 2 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQGISVLHEMIQQTFHLFSTKDSSATWEQSLLEKFSTELNQQLNDL 
EACVIQEVGVEETPLMNVDSILAVKKYFRJRITLYLTEKKYSPCAWEVV 
RAE I MRS F S F S TNLQKRLRRKE 


SEQIDNO:81 


CHI. 3 


CDLPQTHSLGNRRTLMIMAQMGRI S PFSCLKDRHDFGFPQEEFDGNQF 
QKAQAI SVLHEMIQQTFNLFSTKDS S ATWDETLLDKF YTELYQQLNDL 
EACMMQEVGVEDTPLMNVDS ILTVRKYFRRITLYLTEKKYS PCAWEW 
RAE IMRSFS F S TNLQKRLRRKE 


SEQ ID NO: 82 


CHI. 4 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFGGNQF 
QKAQAI S VLHEMIQQTFNLFSTEDS S AAWDETLLDKF Y IELFQQLNDL 
EACVMQEERVGETPLMNADS ILAVKKYFQRITLYLTEKKYS PCAWEW 
RAE I MRS F S F STNLQKRLRRKE 


SEQ ID NO:83 


CH2.1 


CDLPQTHSLGNRRTLMIMAQMGRI S PFSCLKDRHDFGFPQEEFDGNQF 
QKAQAI SVLHEMIQQTFNLFSTKDS S ATWDETLLDKF YTELYQQLNDL 
EACMIQEVGVEETPLMNEDS I LAVKKYFRRITLYLTEKKYS PCAWEW 
RAE IMRSFSF STNLQKRLRRKE 


SEQ ID NO: 84 


CH2.2 


CDLPQTHSLGNRRALILLAQMGRISPFSCLMDRHDFGFPQEEFDDNQF 
QKAQAI SVLHEMIQQTFNLFSTKDS S ATWDETLLDKF YTELYQQLNDL 
EACMMQEVGVEETPLMNVDS I LTVKKYFRRITLYLTEKKYS PCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO: 85 


CH2.3 


CDLPQTHSLGNRRTLMIMAQMGRI S PFSCLKDRHDFGFPQEEFDGNQF 
QKAQAI SVLHEMIQQTFNLFSTKDS SATWDETLLDKFYTELYQQLNDL 
EACMMQEVGVEETPLMNEDS I LAVKKYFRRITLYLTEKKYS PCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 
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WHAT IS CLAIMED IS: 

1. An isolated or recombinant nucleic acid, comprising: 
a polynucleotide sequence selected from the group consisting of: 

(a) SEQ ID NO:l to SEQ ID NO: 35, or a complementary polynucleotide sequence 

5 thereof; 

(b) a polynucleotide sequence encoding a polypeptide selected from SEQ ID 
NO:36 to SEQ ID NO:70, or a complementary polynucleotide sequence thereof; 

(c) a polynucleotide sequence which hybridizes under highly stringent conditions 
over substantially the entire length of polynucleotide sequence (a) or (b); and 

10 (d) a polynucleotide sequence comprising a fragment of (a), (b), or (c), which 

fragment encodes a polypeptide having antiproliferative activity in a human Daudi cell 
line - based assay. 

2. An isolated or recombinant nucleic acid, comprising: 
a polynucleotide sequence selected from the group consisting of: 

15 (a) SEQ ID NO:72 to SEQ ID NO:78, or a complementary polynucleotide 

sequence thereof; 

(b) a polynucleotide sequence encoding a polypeptide selected from SEQ ID 
NO:79 to SEQ ID NO:85, or a complementary polynucleotide sequence thereof; 

(c) a polynucleotide sequence which hybridizes under highly stringent conditions 
20 over substantially the entire length of polynucleotide sequence (a) or (b); and 

(d) a polynucleotide sequence comprising a fragment of (a), (b) or (c), which 
fragment encodes a polypeptide having antiviral activity in a murine cell line/EMCV - 
based assay. 

3. An isolated or recombinant nucleic acid, comprising: 

25 a. polynucleotide sequence encoding a polypeptide, the polypeptide comprising the 

amino acid sequence: CDLPQTHSLG-Xn-X^-RA-X^-X^-^ 

X 2 6-FSCLKDR-Xm-DFG-X 3 8-P-X4o^ 
HE-Xeo-Xfii-QQTFN-X^-FSTK-X?^ 

X 88 -L-X9o-QQLN-X 9 5-l^ 
30 ILAV-X,2i-KY-X, 2 4-QRrrLYL-Xi32-E-Xi34-KYSPC-Xi40- 
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WEVVRAEIMRSFSFSTNLQKRLRRKE, or a conservatively substituted variation 
thereof, where X| ] is N or D; X 12 is R, S, or K; X15 is L or M; Xi 6 is I, M, or V; X i9 is A or 
G; X 22 is G or R; X 24 is I or T; X 26 is P or H; X 34 is H, Y or Q; X 38 is F or L; X40 is Q or R; 
X45 is G or S; X46 is N or H; X47 is Q or R; X 50 is K or R; X 5J is A or T; X55 is S or F; X 56 
5 is V or A; X 57 is L or F; X60 is M or I; X 6i is I or M; X 67 is L or F; X 72 is D or N; X 75 is A 
or V; X 7 6 is A or T; X 7 8 is E or D; X 79 is Q or E; X 8 o is S, R, T, or N; X 83 is E or D; X 85 is 
F or L; X 86 is S or Y; X 88 is E or G; X90 is Y, H, N; X 95 is D, E, or N; X101 is I, M, or V; 
X103 is E or G; X105 is G or W; X106 is V or M; Xi 07 is E, G, or K; Xi 08 is E or G; Xi !4 is V, 
E, or G; Xi i 6 is S or P; Xm is K or R; X, 2 4 is F or L; X n2 is T, I, or M; X134 is K or R; and 
10 X|4oisAorS. 

4. The nucleic acid of claim 3, said polypeptide ha vinganti proliferative 
activity in a human Daudi cell line-based cell proliferation assay or antiviral activity in a 
human WISH cell/EMCV-based assay. 

5. The nucleic acid of claim 3, wherein the encoded polypeptide has an 
15 antiproliferative activity of at least about 8.3xl0 6 units/milligram in a human Daudi cell 

line - based assay or an antiviral activity of at least about 2.1xl0 7 units/milligram in a 
human WISH cell/EMCV-based assay. 

6. The nucleic acid of claim 3, wherein the encoded polypeptide 
comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:36 

20 to SEQ ID NO:54. 

7. The nucleic acid of claim 3, said nucleic acid comprising a 
polynucleotide sequence selected from the group consisting of: SEQ ID NO:l to SEQ ID 
NO: 19. 

8. An isolated or recombinant nucleic acid comprising a polynucleotide 
25 sequence encoding a polypeptide, the polypeptide comprising: 

an amino acid sequence comprising at least 20 contiguous amino acids of any one 
of SEQ ID NOS:36-70, and one or more of amino acids Alal9, (Tyr or Gln)34, Gly37, 
Phe38, Lys71, Ala76, Tyr90, Ilel32, Argl34, Phel52, Lysl60, and Glul66, wherein the 
numbering of the amino acids corresponds to that of SEQ ID NO:36. 
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9- The nucleic acid of claim 8, wherein the encoded polypeptide is 166 
amino acids in length. 

10. The nucleic acid of claim 8, wherein the encoded polypeptide has an 
antiproliferative activity in a human Daudi cell line - based assay. 

5 11. The nucleic acid of claim 8, wherein the encoded polypeptide has an 

antiviral activity in a human WISH cell/EMCV-based assay. 

12. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises amino acids Alal9, (Tyr or GIn)34, Giy37, Phe38, Lys71, Ala76, Tyr90, 
IIel32, Argl34, Phel52, Lysl60, and Glul66. 

10 13. The nucleic acid of claim 8, wherein the encoded polypeptide 

comprises at least 50 contiguous amino acid residues of any one of SEQ ID NOS:36-70. 

14. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises at least 100 contiguous amino acid residues of any one of SEQ ID NOS:36-70. 

15. The nucleic acid of claim 8, wherein the encoded polypeptide 

15 comprises at least 1 50 contiguous amino acid residues of any one of SEQ ID NOS:36-70. 

16. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:39, SEQ ED NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ 
ID NO:45, and SEQ ID NO:46. 

20 17. The nucleic acid of claim 8, comprising a polynucleotide sequence 

selected from the group consisting of: SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:10, and SEQ ID NO:l 1. 

18. An isolated or recombinant nucleic acid comprising a polynucleotide 
sequence encoding a polypeptide, the polypeptide comprising: 
25 an amino acid sequence comprising at least 155 contiguous amino acids of any 

one of SEQ ID NOS: 36-70, said amino acid sequence comprising amino acids Lysl60 
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and Glul66, wherein the numbering of the amino acids corresponds to that of SEQ ID 
NO:36. 

19. The nucleic acid of claim 18, wherein the encoded polypeptide 
comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:36, 

5 SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ED NO:42, SEQ 
ID NO:45, and SEQ ID NO:46. 

20. A cell comprising the nucleic acid of claim 1, 2, 8, or 18. 

21. The cell of claim 20, wherein the cell expresses a polypeptide encoded 
by the nucleic acid. 

10 22. A vector comprising the nucleic acid of claim 1, 2, 8, or 18. 

23. The vector of claim 20, wherein the vector comprises a plasmid, a 
cosmid, a phage, or a virus. 

24. The vector of claim 22, wherein the vector is an expression vector. 

25. A cell transduced by the vector of claim 22. 

15 26. A composition comprising the nucleic acid of claim 1, 2, 8, or 18, and 

an excipient. 

27. The composition of claim 26, wherein the excipient is a 
pharmaceutically acceptable excipient. 

28. A composition produced by digesting one or more nucleic acids of 
20 claim 1, 2, 3, 8, or 18 with a restriction endonuclease, an RNAse, or a DNAse. 

29. A composition produced by a process comprising incubating one or 
more nucleic acids of claim 1, 2, 3, 8, or 18 in the presence of deoxyribonucelotide 
triphosphates and a nucleic acid polymerase. 

30. The composition of claim 29, wherein the nucleic acid polymerase is a 
25 thermostable polymerase. 
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31. An isolated or recombinant polypeptide encoded by the nucleic acid 
of acid claim 1, 2, 3, 8, or 18. 

32. The isolated or recombinant polypeptide of claim 31, comprising a 
sequence selected from the group consisting of: SEQ ID NO:36 to SEQ ID NO:70 or SEQ 

5 ED NO:79 to SEQ ID NO:85. 

33. The polypeptide of claim 31, having an antiproliferative activity of at 
least about 8.3xl0 6 units/milligram (mg) in a human Daudi cell line - based assay or an 
antiviral activity of at least about 2.1xl0 7 units/milligram in a human WISH cell/EMCV- 
based assay. 

10 34. An isolated or recombinant polypeptide, comprising: 

the amino acid sequence: CDLPQTHSLG-X u -Xi 2 -RA-X 15 -Xi 6 -^ 

R-X 2 4-S-X26-FSCLKDR-X34-DFG-X 3 8-P-^ 
X 5 6-X57-HE-X 6 o-X 61 -QQTFN-X67-FST^ 

X86-T-X 8 8-L-X 9 o-QQLN^ 

15 D-Xno-ILAV-Xni-KY.Xu^QRITLYL-Xna-E-Xm-KYSPC.X!^- 

WEVVRAEIMRSFSFSTNLQKRLRRKE, or a conservatively substituted variation 

thereof; 

wherein Xi i is N or D; X !2 is R, S, or K; X i5 is L or M; X !6 is I, M, or V; X ]9 is A 
or G; X 2 2 is G or R; X 24 is I or T; X 26 is P or H; X 34 is H, Y or Q; X 38 is F or L; X40 is Q or 

20 R; X45 is G or S; X46 is N or H; X47 is Q or R; X 50 is K or R; X51 is A or T; X 55 is S or F; 
X 56 is V or A; X 57 is L or F; X 6 o is M or I; Xei is I or M; is L or F; X72 is D or N; X75 
is A or V; X 76 is A or T; X 78 is E or D; X79 is Q or E; X 80 is S, R, T, or N; X 83 is E or D; 
X 85 is F or L; X 86 is S or Y; X 88 is E or G; X90 is Y, H, N; X 95 is D, E, or N; X101 is I, M, 
or V; X 103 is E or G; X105 is G or W; X106 is V or M; X ia7 is E, G, or K; Xios is E or G; 

25 X ll4 is V, E, or G; X n6 is S or P; X m is K or R; X, 24 is F or L; Xj 32 is T, I, or M; X 134 is K 
or R; and X140 is A or S. 

35. The polypeptide of claim 34, having antiproliferative activity of at 
least about 8.3xl0 6 units/milligram in a human Daudi cell line - based assay or antiviral 
activity of at least about 2.1xl0 7 units/milligram in a human WISH cell/EMCV-based 
30 assay. 

127 



WO 01/25438 



PCT/US00/27781 



36. The polypeptide of claim 34, comprising a sequence selected from the 
group consisting of: SEQ ID NO:36 to SEQ ID NO:54. 

37. A polypeptide comprising at least 100 contiguous amino acids of a 
protein encoded by a coding polynucleotide sequence, the polynucleotide sequence 
selected from the group consisting of: 

(a) SEQ ID NO:l to SEQ ID NO:35 or SEQ ID NO:72 to SEQ ID NO:78; 

(b) a coding polynucleotide sequence that encodes a first polypeptide selected 
from SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85; 
and 

(c) a complementary polynucleotide sequence which hybridizes under highly 
stringent conditions over substantially an entire length of a polynucleotide 
sequence of (a) or (b). 

38. The polypeptide of claim 37, said polypeptide having an 
antiproliferative activity in a human Daudi cell line-based cell proliferation assay or an 
antiviral activity in a human WISH cell/EMC V -based assay. 

39. The polypeptide of claim 37, wherein the polypeptide specifically 
binds to a human alpha-interferon receptor. 

40. The polypeptide of claim 37, comprising at least 150 contiguous 
amino acids of the encoded protein. 

41. An isolated or recombinant polypeptide, comprising: 

an amino acid sequence comprising at least 50 contiguous amino acids of any one of SEQ 
ID NOS:36-70, the amino acid sequence comprising one or more of amino acids Alal9, 
(Tyr or Gln)34, Gly37, Phe38, Lys71, Ala76, Tyr90, Ilel32, Argl34, Phel52, Lysl60, and 
Glul66, wherein the numbering of the amino acids corresponds to that of SEQ ID NO:36. 

42. The polypeptide of claim 41, wherein the polypeptide binds a human 
alpha-interferon receptor. 
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43. The polypeptide of claim 41, said polypeptide exhibiting an 
antiproliferative activity in a human Daudi cell line-based cell proliferation assay or an 
antiviral activity in a human WISH cell/EMCV-based assay. 

44. The polypeptide of claim 41, having an antiproliferative activity of at 
5 least about 8.3xl0 6 units/milligram in a human Daudi cell line - based assay or an antiviral 

activity of at least about 2.1xl0 7 units/milligram in a human WISH cell/EMCV-based 
assay. 

45. The polypeptide of claim 41, wherein the polypeptide is 166 amino 

acids in length. 

10 46. The polypeptide of claim 41, said polypeptide comprising amino acids 

Alal9, (Tyr or Gln)34, Gly37, Phe38, Lys71, Ala76, Tyr90, Uel32, Argl34, Phel52, 
Lysl60, and Glul66, wherein the numbering of the amino acids of said polypeptide 
corresponds to the numbering of amino acids in SEQ ID NO:36. 

47. The polypeptide of claim 41, comprising at least 100 contiguous 
15 amino acid residues of any one of SEQ ID NOS: 36-70. 

48. The polypeptide of claim 41, comprising at least 150 contiguous 
amino acid residues of any one of SEQ ID NOS:36-70. 

49. The polypeptide of claim 41, comprising at least 155 contiguous 
amino acid residues of any one of SEQ ED NOS:36-70. 

20 50. The polypeptide of claim 41, comprising an amino acid sequence 

selected from the group consisting of: SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, 
SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:45, and SEQ ID NO:46. 

51. An isolated or recombinant polypeptide comprising an amino acid 
sequence comprising at least 155 contiguous amino acids of any one of SEQ ID NOS:36- 
25 70, the isolated or recombinant polypeptide comprising amino acids Lysl60 and Glul66, 
wherein the numbering of the amino acids corresponds to that of SEQ ID NO:36. 
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52. The polypeptide of claim 51, comprising an amino acid sequence 
selected from the group consisting of: SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, 
SEQ ED NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ED NO:45, and SEQ ID NO:46. 

53. The polypeptide of claim 51, said polypeptide having an 

5 antiproliferative activity of at least about 8.3xl0 6 units/milligram in milligram in a human 
Daudi cell line - based assay or an antiviral activity of at least about 2.1xl0 7 
units/milligram in a human WISH cell/EMCV-based assay. 

54. The polypeptide of claim 31, 34, 37, 41, or 51, further comprising a 
secretion/localization sequence. 

10 55. The polypeptide of claim 31, 34, 37, 41, or 51, further comprising a 

polypeptide purification subsequence. 

56. The polypeptide of claim 55, wherein the sequence that facilitates 
purification is selected from the group consisting of: an epitope tag, a FLAG tag, a 
polyhistidine tag, and a GST fusion. 

15 57. The polypeptide of claim 31, 34, 37, 41, or 51, further comprising a 

Met at the N-terminus. 

58. The polypeptide of claim 31, 34, 37, 41, or 51, comprising a modified 

amino acid. 

59. The polypeptide of claim 58, wherein the modified amino acid is 

20 selected from the group consisting of: a glycosylated amino acid, a PEGylated amino acid, 
a farnesylated amino acid, an acetylated amino acid, and a biotinylated amino acid. 

60. A composition comprising the polypeptide of claim 3 1 , 34, 37, 41 , or 
51 and an excipient. 

61. The composition of claim 60, wherein the excipient is a 
25 pharmaceutically acceptable excipient. 
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62. A composition comprising the polypeptide of claim 58 in a 
pharmaceutical^ acceptable excipient. 

63. A polypeptide which is specifically bound by a polyclonal antisera 
raised against at least one antigen, said at least one antigen comprising at least one amino 

5 acid sequence of SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85, 
or a fragment thereof, wherein the antisera is subtracted with an IFN-alpha polypeptide 
encoded by a nucleic acid corresponding to one or more of GenBank accession number: 
J00210 (alpha-D), J00207 (Alpha- A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 
(alpha-H), V00542 (alpha-14), V00545 (EFN-1B), X03125 (alpha-8), X02957 (alpha-16), 
10 V00540 (alpha-21), X02955 (alpha^b), V00532 (alpha-C), X02960 (alpha-7), X02961 
(alpha-10 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), 
M38289, V00549 (alpha-2a), and 108313 (alpha-Con 1). 

64. An antibody or antisera produced by administering the polypeptide of 
claim 31, 34, 37, 41, or 51 to a mammal, which antibody or antisera specifically binds at 

15 least one antigen, said at least one antigen comprising a polypeptide comprising one or 
more of the amino acid sequences of SEQ ID NO:36 to SEQ ID NO:70 and SEQ ID 
NO:79 to SEQ ID NO:85, or a fragment thereof, which antibody or antisera does not 
specifically bind to an IFN-a polypeptide encoded by a nucleic acid corresponding to one 
or more of GenBank accession number: J00210 (alpha-D), J00207 (Alpha-A), X02958 

20 (Alpha-6), X02956 (AIpha-5), V00533 (alpha-H), V00542 (alpha-14), V00545 (IFN-1B), 
X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), V00532 
(alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 (Gx-1), 101614, 
101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 108313 (alpha- 
Con 1). 

25 65. An antibody or antisera which specifically binds a polypeptide, the 

polypeptide comprising a sequence selected from the group consisting of: SEQ ID NO:36 
to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO: 85, wherein the antibody or antisera 
does not specifically bind to an IFN-alpha polypeptide encoded by a nucleic acid 
corresponding to one or more of GenBank accession number: J00210 (alpha-D), J00207 

30 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), 
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V00545 (DFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 
(Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 
108313 (alpha-Con 1). 

66. A method of producing a polypeptide, the method comprising: 
introducing into a population of cells a nucleic acid of claim 1, 2, 3, 8, or 18, the 

nucleic acid operatively linked to a regulatory sequence effective to produce the encoded 
polypeptide; and 

culturing the cells in a culture medium to produce the polypeptide. 

67. A method of producing a polypeptide, the method comprising: 
introducing into a population of cells a recombinant expression vector comprising 

the nucleic acid of claim 1, 2, 3, 8, or 18; and 

culturing the cells in a culture medium under conditions suitable to produce the 
polypeptide encoded by the expression vector. 

68. A method of inhibiting growth of population of tumor cells, the 
method comprising: 

contacting the population of tumor cells with an effective amount of a polypeptide 
of claim 31, 34, 37, 41, or 51 sufficient to inhibit growth of tumor cells in said population 
of tumor cells, thereby inhibiting growth of tumor cells in said population of cells. 

69. The method of claim 68, wherein the tumor cells are selected from the 
group consisting of: human carcinoma cells, human leukemia cells, human T-Iymphoma 
cells, and human melanoma cells. 

70. The method of claim 68, wherein the tumor cells are in culture. 

71. A method of inhibiting the replication of a virus within at least one 
cell infected by the virus, the method comprising: 

contacting said at least one infected cell with an effective amount of a polypeptide 
of claim 31, 34, 37, 41, or 51 sufficient to inhibit viral replication in said at least one 
infected cell, thereby inhibiting replication of the virus in said at least one infected cells. 
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72. The method of claim 71, wherein the virus is an RNA virus. 

73. The method of claim 72, wherein the virus is a human 
immunodeficiency virus or a hepatitis C virus. 

74. The method of claim 71, wherein the virus is a DNA virus. 

5 75. The method of claim 74, wherein the virus is a hepatitis B virus. 

76. The method of claim 71, wherein the cells are cultured. 

77. A method of treating an autoimmune disorder in a patient, the method 
comprising: administering to the patient an effective amount of the polypeptide of claim 
31,34, 37, 41, or 51. 

10 78. The method of claim 77, wherein the autoimmune disorder is selected 

from the group consisting of multiple sclerosis, rheumatoid arthritis, lupus erythematosus, 
and type I diabetes. 

79. In a method of treating a disorder treatable by administration of 
interferon-alpha to a subject, an improved method comprising: administering to the subject 

15 an effective amount of the polypeptide of claim 31, 34, 37, 41, or 51. 

80. The method claim 79, wherein the disorder treatable by administration 
of interferon-alpha is selected from the group consisting of: sclerosis, rheumatoid arthritis, 
lupus erythematosus, and type I diabetes. 

81. A method of for making a modified or recombinant nucleic acid, the 
20 method comprising: 

recursively recombining a sequence of one or more nucleic acids of claim 1, 2, 3, 
8, or 18 with a sequence of one or more additional nucleic acids, each sequence of the one 
or more additional nucleic acids encoding an interferon-alpha or an amino acid 
subsequence thereof. 

25 82. The method of claim 81, wherein said recursive recombination 

produces at least one library of recombinant interferon-alpha homologue nucleic acids. 
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83. A nucleic acid library produced by the method of claim 82. 

84. A population of cells comprising the library of claim 83. 

85. A recombinant interferon-alpha homologue nucleic acid produced by 
the method of claim 82. 

5 86. A cell comprising the nucleic acid of claim 85. 

87. The method of claim 81, wherein the recursive recombination is 
performed in vitro. 

88. The method of claim 81, wherein the recursive recombination is 
performed in vivo or ex vivo. 

10 89. A composition comprising two or more nucleic acids of claim 1, 2, 3, 

8, or 18. 

90. The composition of claim 89, wherein the composition comprises a 
library comprising at least ten nucleic acids. 

91. A method of producing a modified or recombinant interferon-alpha 
15 homologue nucleic acid comprising mutating a nucleic acid of claim 1, 2, 3, 8, or 18. 

92. The modified or recombinant interferon-alpha homologue nucleic 
acid produced by the method of claim 91. 

93. A computer or computer readable medium comprising a database 
comprising a sequence record comprising one or more character strings corresponding to a 

20 nucleic acid or protein sequence selected from SEQ ID NO: 1 to SEQ ID NO:85. 

94. An integrated system comprising a computer or computer readable 
medium comprising a database comprising one or more sequence records, each of said 
sequence records comprising one or more character strings corresponding to a nucleic acid 
or protein sequence selected from SEQ ID NO:l to SEQ ID NO: 85, the integrated system 

25 further comprising a user input interface allowing a user to selectively view said one or 
more sequence records. 
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95. The integrated system of claim 94, the computer or computer readable 
medium comprising an alignment instruction set which aligns the character strings with 
one or more additional character strings corresponding to a nucleic acid or protein 
sequence. 

5 96. The integrated system of claim 95, wherein the instruction set 

comprises one or more of: a local homology comparison determination, a homology 
alignment determination, a search for similarity determination, and a BLAST 
determination. 

97. The integrated system of claim 95, further comprising a user readable 
10 output element which displays an alignment produced by the alignment instruction set. 

98. The integrated system of claim 94, the computer or computer readable 
medium further comprising an instruction set which translates at least one nucleic acid 
sequence comprising a sequence selected from SEQ ID NO: 1 to SEQ ID NO:35 or SEQ 
ID NO:72 to SEQ ID NO:78 into an amino acid sequence. 

1 5 99. The integrated system of claim 94, the computer or computer readable 

medium further comprising an instruction set for reverse-translating at least one amino 
acid sequence comprising a sequence selected from SEQ ID NO:36 to SEQ ID NO:70 or 
SEQ ID NO:79 to SEQ ID NO:85 into a nucleic acid sequence. 

100. The integrated system of claim 99, wherein the instruction set selects 
20 the nucleic acid sequence by applying a codon usage instruction set or an instruction set 

which determines sequence identity to a test nucleic acid sequence. 

101. A method of using a computer system to present information 
pertaining to at least one of a plurality of sequence records stored in a database, said 
sequence records each comprising at least one character string corresponding to SEQ ID 

25 NO:l to SEQ ID NO:85, the method comprising: 

determining a list of at least one character string corresponding to one or more of 
SEQ ID NO: I to SEQ ID NO:85 or a subsequence thereof; 

determining which of said at least one character string of said list are selected by a 
user; and 
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displaying each of the selected character strings, or aligning each of the selected 
character strings with an additional character string. 

102. The method of claim 101, further comprising displaying an alignment 
of each of the selected character strings with the additional character string. 

5 103. The method of claim 101, further comprising displaying the list. 

104. A nucleic acid which comprises a unique subsequence in a nucleic 
acid selected from SEQ ID NO:l to SEQ ED NO:35 or SEQ ID NO:72 to SEQ ID NO:78, 
wherein the unique subsequence is unique as compared to a nucleic acid sequence of a 
known interferon-alpha nucleic acid sequence or a nucleic acid corresponding to any of 
10 GenBank accession number: J00210 (alpha-D), J00207 (AJpha-A), X02958 (Alpha-6), 
X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), V00545 (IFN-1B), X03125 
(alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), V00532 (alpha-C), 
X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 (Gx-1), 101614, 101787, 
107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 108313 (alpha-Conl). 

15 105. A polypeptide which comprises a unique subsequence in a 

polypeptide selected from: SEQ ED NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ 
ID NO:85, wherein the unique subsequence is unique as compared to a sequence of a 
known interferon-alpha polypeptide or a sequence of a polypeptide encoded by a nucleic 
acid corresponding to any of GenBank accession number: J00210 (alpha-D), J00207 

20 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), 
V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 
(Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 
108313 (alpha-Conl). 

25 106. A target nucleic acid which hybridizes under stringent conditions to a 

unique coding oligonucleotide which encodes a unique subsequence in a polypeptide 
selected from: SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85, 
wherein the unique subsequence is unique as compared to a sequence of a known 
interferon-alpha polypeptide or a sequence of a polypeptide encoded by a nucleic acid 
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corresponding to any of GenBank accession number: J00210 (alpha-D), J00207 (Alpha- 
A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14), 
V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha- 16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 
5 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha-2a), and 
108313 (alpha-Conl). 

107. The nucleic acid of claim 106, wherein the stringent conditions are 
selected such that a perfectly complementary oligonucleotide to the unique coding 
oligonucleotide hybridizes to the unique coding oligonucleotide with at least a 5x higher 

10 signal to noise ratio than for hybridization of the perfectly complementary oligonucleotide 
to a control nucleic acid corresponding to any of GenBank accession number: J00210 
(alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), 
V00542 (alpha-14), V00545 (IFN-1B), X03125 (alpha-8), X02957 (alpha-16), V00540 
(alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (aipha-7), X02961 (alpha-10 

15 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289,V00549 
(alpha-2a), and 108313 (alpha-Conl), wherein the target nucleic acid hybridizes to the 
unique coding oligonucleotide with at least a 2x higher signal to noise ratio as compared to 
hybridization of the control nucleic acid to the coding oligonucleotide. 

108. The nucleic acid of any of claims 1, 2, 3, 8, or 18, wherein the nucleic 
20 acid encodes an interferon-alpha homologue having an increased growth inhibition 

activity against a population of cancer cells relative to a growth inhibition activity of 
human interferon-alpha 2a against said population of cancer cells. 

109. The nucleic acid of claim 108, wherein the cancer cells of said 
population of cancer cells comprise a cancer cell line selected from: a leukemia cell line, a 

25 melanoma cell line, a lung cancer cell line, a colon cancer cell line, a central nervous 
system (CNS) cancer cell line, an ovarian cancer cell line, a breast cancer cell line, a 
prostate cancer cell line, and a renal cancer cell line, and the growth inhibition activity is 
measured as a concentration of interferon-alpha homologue producing a 50% inhibition of 
growth of the cancer cell line (GI50 value), wherein the interferon-alpha homologue has a 

30 GI50 value at least 2-fold lower than the GI50 value of the human interferon-alpha 2a. 



137 



WO 01/25438 



PCTAJS00/27781 



110. The nucleic acid of claim 109, wherein the encoded interferon-alpha 
homologue has a GI50 value at least 5-fold lower than the GI50 value of the human 
interferon-alpha 2a. 

111. The nucleic acid of claim 107, wherein the encoded interferon-alpha 
5 homologue has a GI50 value at least 10-fold lower than the GI50 value of the human 

interferon-alpha 2a. 

112. The nucleic acid of any of claims 1, 2, 3, 8, or 18, wherein the nucleic 
acid encodes an interferon-alpha homologue having increased an cytostatic activity against 
a population of cancer cells relative to the cytostatic activity of human interferon-alpha 2a 

10 against said population of cancer cells. 

113. The nucleic acid of claim 1 12, wherein the cancer cells comprise a 
cancer cell line selected from: a leukemia cell line, a melanoma cell line, a lung cancer cell 
line, a colon cancer cell line, a CNS cancer cell line, an ovarian cancer cell line, a breast 
cancer cell line, a prostate cancer cell line, and a renal cancer cell line, the cytostatic 

15 activity measured as the concentration of an interferon-alpha causing a total inhibition of 
growth of the cell line (TGI value), wherein the interferon-alpha homologue has a TGI 
value at least 2-fold lower than the TGI value of the human interferon-alpha 2a. 

114. The nucleic acid of claim 1 12, wherein the encoded interferon-alpha 
homologue has a TGI value at least 5-fold lower than the TGI value of the human 

20 interferon-alpha 2a. 

115. The nucleic acid of claim 112, wherein the encoded interferon-alpha 
homologue has a TGI value at least 10-fold lower than the TGI value of the human 
interferon-alpha 2a. 

116. The nucleic acid of any of claims 1, 2, 3, 8, or 18, wherein the nucleic 
25 acid encodes an interferon-alpha homologue having an increased cytotoxic activity against 

a population of cancer cells relative to the cytotoxic activity of human interferon-alpha 2a 
against said population of cancer cells. 
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117. The nucleic acid of claim 1 16, wherein the cancer cells comprise a 
cancer cell line selected from: a leukemia cell line, a melanoma cell line, a lung cancer cell 
line, a colon cancer cell line, a central nervous system (CNS) cancer cell line, an ovarian 
cancer cell line, a breast cancer cell line, a prostate cancer cell line, and a renal cancer cell 
5 line, the cytotoxic activity measured as the concentration of interferon-alpha producing a 
50% reduction in an amount of cellular protein in a cell line measured after a period of 
incubation (LC50 value), wherein the interferon-alpha homologue has a LC50 value at 
least 2-fold lower than the LC50 value of the human interferon-alpha 2a. 

118- The nucleic acid of claim 116, wherein the encoded interferon-alpha 
. 10 homologue has a LC50 value at least 5-fold lower than the LC50 value of the human 
interferon-alpha 2a. 

119. The nucleic acid of claim 1 16, wherein the encoded interferon-alpha 
homologue has a LC50 value at least 10-fold lower than the LC50 value of the human 
interferon-alpha 2a. 

15 120. The polypeptide of any of claims claim 31, 34, 37, 41, or 51, said 

polypeptide having an increased growth inhibition activity against a population of cancer 
cells relative to the inhibition activity of human interferon-alpha 2a against the population 
of cancer cells. 

121. The polypeptide of claim 120, wherein the population of cancer cells 
20 comprises a cancer cell line selected from: a leukemia cell line, a melanoma cell line, a 

lung cancer cell line, a colon cancer cell line, a CNS cancer cell line, an ovarian cancer 
cell line, a breast cancer cell line, a prostate cancer cell line, and a renal cancer cell line, 
the growth inhibition activity measured as the concentration of polypeptide or human 
interferon-alpha 2a causing a 50% inhibition of growth of the cell line (GI50 value), 
25 wherein the polypeptide has a GI50 value at least 2-fold lower than the GI50 value of the 
human interferon-alpha 2a. 

122. A nucleic acid produced by the method of claim 81. 

123. An interferon-alpha polypeptide or amino acid subsequence thereof 
produced by the method of claim 81. 
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LLEKFSTELYQQLNELEACVIQEVGVGETPLMNGDSILAV 
LLEKFSTELYQQLNELEACVIQEVGVEETPLMNEDSILAV 
LLEKFSTELYQQLNDLEACVIQEVGVEETPLMNVDSILAV 
LLEKFSTELYQQLNDLEACVIQEVGVEETPLMNEDSILAV 
LLEKFSTELYQQLNDLEACVIQEVGVEETPLMNVDSILAV 
LLEKFSTELYQQLNDLEACVIQEVGVEETPLMNEDSILAV 
LLEKFSTELNQQLNDLEACVIQEVGVEETPLMNVDPILAV 
LLEKFSTELHQQLNELEACWQEVGVEETPLMNEDSILAV 
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LLEKFSTELYQQLNELEACVIQEVGVEETPLMNEDSILAV 
LLEKFSTELNQQLNDLEACVIQEVGVEETPLMNEDSILAV 
LLEKFSTELYQQLNNLEACVIQEVGMEETPLMNVDSILAV 
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LLEKFSTELHQQLNDLEACVIQEVGVEETPLMNEDSILAV 
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LLEKFSTELYQQLNNLEACVIQEVGVEETPLMNEDSILAV 
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LLEKFSTELYQQLNNLEACVIQEVGMEETPLMNEDSILAV 
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LLEKFSTELYQQLlSnSILEACVIQEVGVEETPLMNVDSIIiAV 
LLEKFSTELNQQLNDLEACVIQEVGVEETPLVNVDSILAV 
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LLEKFYIELFQQLNDLEACVIQEVGVEETPLMNVDSILAV 
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LLDKFYTELYQQLNDLEACVIQEVGVEETPLMNEDSILAV 
LLEKFSTELNQQLNDLEACVIQEVGVEETPLMNVDSILAV 
LLDKFYTELYQQLNDLEACMMQEVGVEDTPLMNVDSILTV 
LLDKFYIELFQQLNDLEACVMQEERVGETPLMNADSILAV 
LLDKFYTELYQQLNDLEACMIQEVGVEETPLMNEDSILAV 
LLDKFYTELYQQLNDLEACMMQEVGVEETPLMNVDS I LTV 
LLDKFYTELYQQLNDLEACMMQEVGVEETPLMNEDSILAV 
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RKYFQRITLYLIERKYSPCAWEWRAEIMRSFSFSTNLQK 
KKYFQRITLYLIERKYSPCAWEWRAEIMRSFSFSTNLQK 
KKYFQRITLYLTEKKYSPCSWEWRAEIMRSFSFSTNLQK 
RKYFQRI TL YL I ERKYS PCAWE WRAE IMRSFS FSTNLQK 
KKYFQRITLYLIERKYSPCAWEWRAEIMRSFSFSTNLQK 
RKYFQRITLYL I ERKY S PCAWE WRAE IMRS F S FSTNLQK 
KKYFQRITLYLIERKYSPCAWEWRAEIMRSFSFSTNLQK 
KKYFQR I TLYLTEKKYS PC AWE WRAE IMRSFS FSTNLQK 
KKYLQRITLYLTEKKYSPCAWEWRAEIMRSFSFSTNLQK 
RKYFQRITLYL I ERKYS PCAWEWRAEIMRSFSFSTNLQK 
RKYFQRI TL YL I ERKYS PC AWE WRAE IMRS FSFSTNLQK 
RKYFQRI TLYLTEKKYS PCAWEWRAEIMRSFSFSTNLQK 
RKYFQRITLYLTEKKYS PC SWEWRAE IMRS FSFSTNLQK 
KKYFQRITLYLTEKKYSPCSWEWRAEIMRSFSFSTNLQK 
KKYFQRITLYLTEKKYS PCAWEWRAEIMRSFSFSTNLQK 
KKYFQRI TLYLTEKKYS PC SWEWRAE IMRSFSFSTNLQK 
KKYFQRITLYLTEKKYS PCAWEWRAEIMRSFSFSTNLQK 
RKYFQRITLYL I ERKYSPCAWEWRAE IMRS FSFSTNLQK 
KKYFQRITLYLMEKKYS PC AWE WRAE IMRS FSFSTNLQK 
RKYFQRITLYLTKKKYS PC SWEVVRAE IMRS FSFSTNLQK 
RKYFQRITLYLMEKKYS PCAWEWRAEIMRSFSFSTNLQK 
KKYFQRITLYLTEKKYS PC AWE WRAE IMRSFSFSTNLQK 
KKYFQRITLYLTEKKYS PCAWE WRAE IMRSFSFSTNLQK 
RKYFQRITLYLMEKKYS PCAWEWRAEIMRSFSFSTNLQK 
KKYFQRITLYLTEKKYS PCAWEWRAEIMRSFSFSTNLQK 
KKYFQRITLYLTEKKYS PCAWEWRAEIMRSFSFSTNLQK 
KKYFRRITLYLTEKKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFQRITLYLTERKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFQRI TLYLTEKKYS PCAWEWRAE IMRSFSFSTNLQK 
KKYFQR I TL YL I ERKYS PCAWEWRAE IMRS FSFSTNLQK 
RKYFQRI TLYLTEKKH S PC SWEWRAE IMRS FSF STNLQK 
RKYFQRITLYLTEEKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFRRI TLYLTEKKYS PC AWEAVRAE IMRSFSFSTNLQK 
KKYFQR I TLYLTEKKYS PCAWEWRAE IMRSFS FSTNLQK 
KKYFQRITLYLTERKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFRRITLYLTEKKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFRRITLYLTEKKYSPCAWEWRAE IMRSFSFSTNLQK 
RKYFRRITLYLTEKKYS PCAWEWRAE IMRSFSFSTNLQK 
KKYFQRITLYLTEKKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFRRITLYLTEKKYSPCAWEWRAE IMRSFSFSTNLQK 
KKYFRRI TLYLTEKKYS PCAWEWRAE IMRSFSFSTNLQK 
KKYFRRITLYLTEKKYSPCAWEWRAEIMRSFSFSTNLQK 
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SEQUENCE LISTING 



<110> HEINRICHS, VOLKER 
CHEN, TEDDY 
PATTEN, PHILLIP A 



10 



<120> I FN- ALPHA HOMOLOGUES 



<130> 02-101510/0140.002 



<140> 
<141> 



15 



<150> 
<151> 



09/415, 183 
1999-10-07 



20 



<160> 88 



<170> Patentln Ver. 2.0 



25 



<210> 1 
<211> 498 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2DH12 



35 



40 



45 



<400> 1 

tgtgatctgc 

atgggacgaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aggaagtact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
taggggtgaa 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggc 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactctac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagac 
gctcaagcca 
aaggattcat 
cagcagctga 
ctgatgaatg 
ctaatagaga 
tctttctctt 



ccttgatgct 
aagactttgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
tggactccat 
ggaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagacc 240 
agcctgcgtg 3 00 
cctggctgtg 360 
cccttgtgca 420 
cttgcaaaaa 480 

498 



<210> 
<211> 
<212> 
<213> 



2 

498 
DNA 

Artificial 



Sequence 



<220> 

50 <223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2CA3 
55 <400> 2 

tgtgatctgc ctcagaccca cagccttggt gacaggaggg ccatgatact cctggcacaa 60 
atgggacgaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
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atccagcaga 
ctcctagaaa 
atacaggagg 
aagaagtact 
tgggaggttg 
agattaagga 



ccttcaatct 
aatt ttccac 
ttggggtggg 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aaggattcat 
cagcagctga 
ctgatgaatg 
ctaatagaga 
tctttctctt 



ctgctgcttg 
atgaactgga 
gggactccat 
ggaaatacag 
tttcaacaaa 



ggaacagagc 240 
agcatgtgtg 300 
cctggctgtg 3 60 
cccttgtgca 420 
cttgcaaaaa 480 

498 



<210> 3 
<211> 498 
10 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

15 

<220> 

<223> Clone ID 4AB9 



<400> 3 
20 tgtgatctgc 
atgggacgaa 
gaggagtttg 
atgcagcaga 
ctcctagaaa 
25 atacaggagg 
aagaaatact 
tgggaggttg 
agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggc 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagac 
gctcaagcca 
aagaactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgctgcttg 
atgaactgga 
aggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
attcccccgg 120 
ccatgagatg 180 
ggatgagacc 240 
agcatgtgtg 300 
cctggctgtg 360 
cccttgttcc 420 
cttgcaaaaa 480 

498 



30 <210> 4 

<211> 498 
<212> DNA 

<213> Artificial Sequence 



35 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2DA4 

40 

<400> 4 

tgtgatctgc ctcagaccca cagccttggt 
atgggaagaa tctctccttt ctcctgcctg 
gaggagtttg atagcaacca gttccagaag 

45 atgcagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactctac 
atacaggagg ttggggtgga agagaccccc 
aggaagtact tccaaagaat cactctttat 
tgggaggttg tcagagcaga aatcatgaga 

50 agattaagga ggaaggaa 

<210> 5 
<211> 498 
<212> DNA 
55 <213> Artificial Sequence 



aacaggaggg ccttgatgct cctggcacaa 60 
aaggacagac aagactttgg attcccccag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aaggactcat ctgctgcttg ggatgagacc 240 
cagcagctga atgacctgga agcctgcgtg 300 
ctgatgaatg tggactccat cctggctgtg 3 60 
ctaatagaga ggaaatacag cccttgtgca 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



2 
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<220> 

<223> Clone ID 3DA11 



10 



15 



20 



<400> 5 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagaccccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagat 
gctcaagcca 
aaggattcat 
cagcagctga 
ctgatgaatg 
ctaatagaga 
tctttctctt 



ccttggtact 
atgatttcgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
aggactccat 
ggaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggatgagacc 240 
agcctgcgtg 3 00 
cc tggctgtg 3 60 
cccttgtgca 420 
cttgcaaaaa 4 80 

498 



<210> 6 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2DB11 

25 

<400> 6 

tgtgatctgc ctcagaccca cagccttggt 
atgggaagaa tctctccttt ctcctgcctg 
gaggagtttg atggcaacca gttccagaag 

30 atccagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactttac 
atacaggagg ttggggtgga agagactccc 
aggaagtact tccaaagaat cactctttat 
tgggaggttg tcagagcaga aatcatgaga 

35 agattaagga ggaaggaa 



aacaggaggg ccttgatgct cctggcacaa 60 

aaggacagat atgatttcgg attcccccag 120 

gctcaagcca tctctgtcct ccatgagatg 180 

aaggattcat ctgctgcttg ggatgagacc 240 

cagcagctga atgacttgga agcctgtgtg 3 00 

ctgatgaatg tggactccat cctggctgtg 3 60 

ctaatagaga ggaaatacag cccttgtgca 420 

tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



<210> 7 

<211> 498 

<212> DNA 

40 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

45 <220> 

<223> Clone ID 2CA5 



<400> 7 

tgtgatctgc ctcagaccca cagccttggt 
50 atgggacgaa tctctccttt ctcctgcctg 
gaggagtttg atggcaaccg gttccagaag 
atccagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactctac 
atacaggagg ttggggtgga agagaccccc 
55 aagaaatact tccaaagaat cactctttat 
tgggaggttg tcagagcaga aatcatgaga 
agattaagga ggaaggaa 



aacaggaggg ccttgatact cctggcacaa 60 
aaggacagac aagactttgg attcccccag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aagaactcat ctgctgcttg ggaacagagc 240 
cagcagctga atgacctgga agcctgcgtg 3 00 
ctgatgaatg aggactccat cctggctgtg 360 
ctaatagaga ggaaatacag cccttgtgca 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



<210> 8 



3 



WO 01/25438 



PCT/US00/27781 



10 



<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2G6 



<400> 8 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

15 atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 240 

ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggaccccat cctggctgtg 360 

aagaaatact tccaaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

20 agattaagga ggaaggaa 498 

<210> 9 
<211> 498 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

30 <220> 

<223> Clone ID 3AH7 

<400> 9 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

35 atgcgaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 

gaggagtttg atagcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaacttcac cagcaactga atgaactgga agcatgtgta 300 

gtacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

40 aagaaatacc tccaaagaat cactctttat ctgacagaga agaagtatag cccttgtgca 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 

<210> 10 
45 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

50 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2G5 
55 <400> 10 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatgct cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac aagactttgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 
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ctcctagaaa 
atacaggagg 
aggaagtact 
tgggaggttg 
agattaagga 



aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



tgaactctac cagcagctga atgacctgga agcctgcgtg 
agagaccccc ctgatgaatg tggactccat cctggctgtg 
cactctttat ctaatagaga ggaaatacag cccttgtgca 
aatcatg'aga tctttctctt tttcaacaaa cttgcaaaaa 



10 



<210> 11 
<211> 498 
<212> DNA 

<213> Artificial Sequence 



300 
360 
420 
480 
498 



15 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2BA8 



<400> 11 
tgtgatctgc 

20 atgggacgaa 
gaggagtttg 
atccagcaga 
ctcctagaaa 
atacaggagg 

25 aggaagtact 
tgggaggttg 
agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagaccccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagat 
gctcaagcca 
aaggattcat 
cagcagctga 
ctaatgaatg 
ctaatagaga 
tctttctctt 



ccctgatact 
atgatttcgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
tggactccat 
ggaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcctgcgtg 3 00 
cctggctgtg 360 
cccttgtgca 420 
cttgcaaaaa 480 

498 



<210> 12 
30 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic DNA 



40 



45 



50 



55 



<220> 

<223> Clone ID 1F3 



<400> 12 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

cttctagaca 

atgcaggagg 

agaaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctcattt 
atggcaacca 
ccttcaacct 
aactctatac 
tgtgggtggg 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agggactccc 
cactctctat 
aatcatgaga 



aacaggaggg 
aaggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgttgcttg 
atgacctgga 
aggactccat 
agaaatacag 
tttcaacaaa 



cctgggacaa 60 
attcccccag 120 
ccatgagatg 180 
ggatgagagg 240 
agcctgtgtg 300 
cctggctgtg 3 60 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 13 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 



5 



WO 01/25438 



PCT/US00/27781 



<223> Clone ID 4BE10 



10 



<400> 13 

tgtgatctgc 

atgggacgaa 

gaggagtttg 

atgcagcaga 

ctcctagaaa 

atacaggggg 

aggaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagat 
gctcaagcca 
aagaactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgatttcgg 
tctctgtcct 
ctgctgcttg 
atgaactgga 
aggactccat 
agaagtatag 
tttcaacaaa 



cctggcacag 60 
attcccccag 120 
ccatgagata 180 
ggatgagacc 240 
agcatgtgtg 300 
cttggctgtg 360 
cccttgttcc 420 
cttgcaaaaa 480 

498 



<210> 14 
15 <211> 498 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

20 <223> Description of Artificial Sequence: Synthetic DNA 



25 



30 



35 



40 



<220> 
<223> 



Clone ID 2DD9 



<400> 14 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctccccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tggactctac 
agagaccccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagat 
gctcaagcca 
aaggattcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatgct 
atgatttcgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
aggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcctgcgtg 3 00 
cctggctgtg 360 
cccttgttcc 420 
cttgcaaaaa 480 

498 



<210> 15 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

45 <223> Clone ID 3CA1 



<400> 15 

tgtgatctgc 

atgggaagaa 

50 gaggagtttg 
atccagcaga 
ctcctagaaa 
atacaggagg 
aagaaatact 

55 tgggaggttg 
agattaagga 

<210> 16 
<211> 498 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttgggatgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggc 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagac 
gctcaagcca 
aagaactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgctgcttg 
ataacctgga 
tggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
attaccccag 120 
ccatgagatg 180 
ggatgagacc 2 40 
agcatgtgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



6 



WO 01/25438 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2F8 



10 



15 



20 



25 



<400> 16 

tgtgatctgc 

atgggacgaa 

gaggagtttg 

atgcagcaga 

ctcctagaaa 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagat 
gctcaagcca 
aagaactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgatttcgg 
tctctgtcct 
ctgctgcttg 
atgaactgga 
aggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 12 0 
ccatgagatg 180 
ggatgagacc 240 
agcatgtgtg 300 
cctggctgtg 360 
cccttgttcc 420 
cttgcaaaaa 480 

498 



<210> 17 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

30 <223> Clone ID 6CG3 



<400> 17 

tgtgatctgc 

atgggaagaa 

35 gaggagtttg 
atccagcaga 
ctcctagaaa 
atacaggaag 
aagaaatact 

40 tgggaggttg 
agattaagga 



ctcagaccca 
cctctccttt 
atggcaacca 
ccttcaattt 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgtctg 
gttccagagg 
cttcagcaca 
tgaacttaac 
agagactccc 
cactctttat 
aatcatgaga 



aacaagaggg 
aaggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccatgatgct 
atgactttgg 
tctttgtcct 
ctgctgcttg 
atgacctgga 
aggactccat 
agaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcctgcgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



45 



50 



<210> 18 

<211> 498 

<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 3CG7 



<400> 18 
55 tgtgatctgc 
atgggaagaa 
gaggagtttg 
atccagcaga 
ctcctagaaa 



ctcagaccca 
tctccccttt 
atggcaacca 
ccttcaatct 
aattttccac 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 



aacagtaggg 
aaggacagac 
gctcaagcca 
aaggattcat 
cagcaactga 



ccttgatgct 
atgatttcgg 
tctctgcctt 
ctgctgcttg 
ataacctgga 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagaac 240 
agcatgtgtg 300 



7 



WO 01/25438 



PCT/US00/27781 



atacaggagg ttgggatgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 
aggaagtact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgcc 42 0 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

5 

<210> 19 
<211> 498 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

15 <223> Clone ID 1D3 



<400> 19 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

atgggaagaa tctctcattt ctcctgcctg aaggacagac atgatttcgg attcccccag 120 

20 gaggagtttg atggccacca gttccagaag actcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaactttac cagcaactga atgacctgga agcatgtgtg 3 00 

atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

aagaaatact tccaaagaat cactctttat ctgatggaga agaaatacag cccttgtgcc 420 

25 tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 



<210> 20 

<211> 498 

30 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

35 

<220> 

<223> Clone ID 2G4 

<400> 20 

40 tgtgatctgc ctcagaccca 

atgagcagaa tctctccttc 

gaggaatttg atgataaaca 

attcagcaga ccttcaatct 

ctcctagaaa aattttccac 

45 atgcaggagg agagggtggg 

aggaaatact tccaaagaat 

tgggaggttg tcagagcaga 

agattaagga ggaaggaa 

50 <210> 21 

<211> 498 

<212> DNA 

<213> Artificial Sequence 

55 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



cagccttggt aacaggaggg 
ctcctgtctg atggacagac 
gttccagaag gctccagcca 
cttcagcaca gaggactcat 
tgaactttac cagcaactga 
agaaactccc ctgatgaatg 
cactctttat ctgacaaaga 
aatcatgaga tctttctctt 



ccatgatgct cctggcacaa 60 
atgactttga atttccccag 120 
tctctgtcct ccatgaggtg 180 
ctgctgcttg ggaacagacc 240 
atgacctgga agcatgtgtg 300 
cggactccat cttggctgtg 3 60 
agaagtatag cccttgttcc 420 
tttcaacaaa cttgcaaaaa 480 

498 



<220> 

<223> Clone ID 1A1 
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<400> 21 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctcattt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
5 gaggtgtttg atggcaacca gttccagaag gcccaagcca tctctgcctt ccatgagatg 180 
atgcagcaga ccttcaatct cttcagcaca gaggactcat ctgctgcttg ggaacagagc 240 
ctcctagaaa aattttccac tgaacttcac cagcaactga atgacctgga agcctgtgtg 300 
atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 3 60 
aggaaatact ttcaaagaat cactctttat ctaatggaga agaaatacag cccttgtgcc 420 
10 tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 



<210> 22 
<211> 498 
15 <212> DNA 

<213> Artificial Sequence 



20 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 1D10 



25 



30 



35 



<400> 22 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctcattt 
atggccacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagac 
actcaagcca 
aaggactcat 
cagcaactga 
ctgatgaatg 
ctgatggaga 
tctttctctt 



ccttgatact 
atgatttcgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
aggactccat 
agaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcatgtgtg 3 00 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 23 
<211> 498 
<212> DNA 
<213> Artificial 



Sequence 



40 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 1F6 

45 

<400> 23 

tgtgatctgc ctcagaccca 

atgggaagaa tctctccttt 

gaggagtttg atggcaacca 

50 atccagcaga ccttcaatct 

ctcctagaaa aattttccac 

atacaggagg ctggggtgga 

aagaaatact tccaaagaat 

tgggaggttg tcagagcaga 

55 agattaagga ggaaggaa 



cagccttggt aacaggagga 
ctcctgcctg aaggacagac 
gttccagaag gctcaagcca 
cttcagcaca aaggactcat 
tgaacttaac cagcagctga 
agagactccc ctgatgaatg 
cactctttat ctaacagaga 
aatcatgaga tctttctctt 



ctttgatgat aatggcacaa 60 
atgactttgg atttccccag 120 
tctctgtcct ccatgagatg 180 
ctgctacttg ggaacagagc 240 
atgacctgga agcctgcgtg 300 
tggactccat cctggctgtg 360 
agaaatacag cccttgtgcc 42 0 
tttcaacaaa cttgcaaaaa 480 

498 



<210> 24 
<211> 498 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

5 

<220> 

<223> Clone ID 2A10 



<400> 24 

10 tgtgatctgc ctcagaccca cagccttggt 
atgggaagaa tctctcattt ctcctgcctg 
gaggtgtttg atggcaacca gttccagaag 
atccagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactttac 

15 atacaggagg ttggggtgga agagactccc 
aggaaatact ttcaaagaat cactctttat 
tgggaggttg tcagagcaga aatcatgaga 
agattaagga ggaaggaa 

20 <210> 25 

<211> 498 

<212> DNA 

<213> Artificial Sequence 



aacaggaggg ccttgatact cctggcacaa 60 
aaggacagat atgatttcgg attcccccag 120 
gctcaagcca tctctgcctt ccatgagatg 180 
aaggactcat ctgctacttg ggaacagagc 240 
cagcaactga ataacctgga agcatgtgtg 300 
ctgatgaatg aggactccat cctggctgtg 360 
ctgatggaga agaaatacag cccttgtgcc 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



25 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2C3 

30 

<400> 25 

tgtgatctgc ctcagaccca cagccttggt 
atgggaagaa tctctccttt ctcctgcctg 
gaggagtttg atggcaacca gtcccagaag 

35 atccagcaga ccttcaatct cttcagcaca 
cttttagaaa aattttccac tgaacttaac 
atacaggagg ttggggtgga agagaccccc 
aagaaatact tccaaagaat cactctttat 
tgggaggttg tcagagcaga aatcatgaga 

40 agattaagga ggaaggaa 



aacaggaggg ccttgatact cctggcacaa 60 
aaggacagac atgactttgg atttcctcag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aaggactcat ctgatacttg ggatgcgacc 240 
cagcagctga atgacctgga agcctgcgtg 300 
ctgatgaatg tggactccat cctggctgtg 3 60 
ctgacagaga agaaatacag cccttgtgcc 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



<210> 26 
<211> 498 
<212> DNA 
45 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



50 <220> 

<223> Clone ID 2D1 

<400> 26 

tgtgatctgc ctcagaccca cagccttggt 
55. atgggacgaa tctctccttt ctcctgcctg 
gaggagtttg atggcaaccg gttccagaag 
atccagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactctac 
atacaggagg ttggggtgga agagaccccc 



aacaggaggg ccttgatact cctggcacaa 60 
aaggacagac aagactttgg attcccccag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aagaactcat ctgctgcttg ggaacagagc 240 
cagcagctga atgacctgga agcctgcgtg 300 
ctgatgaatg aggactccat cctggctgtg 360 
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aagaaatact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

5 <210> 27 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2D10 

15 

<400> 27 

tgtgatctgc ctcagaccca 
atgggaagag tctctccttt 
gaggagtttg atggcaacca 

20 atccagcaga ccttcaatct 
ctcctagaaa aattttccac 
atacaggagg ttggggtgga 
aagaaatact tccgaagaat 
tgggaggttg tcagagcaga 

25 agattaagga ggaaggaa 



cagccttggt aacaggaggg 
ctcctgcctg aaggacagac 
gttccagaag gctcaagcca 
cttcagcaca aaggactcat 
tgaactttac cagcaactga 
agagactccc ctgatgaatg 
cactctctat ctgacagaga 
aatcatgaga tctttctctt 



ccttgatact cctggcacaa 60 
atgactttgg attcccccag 120 
tctctgcctt ccatgagatg 180 
ctgctacttg ggaacagagc 240 
ataacctgga agcctgcgtg 3 00 
tggactccat cctggctgtg 360 
agaaatacag cccttgtgcc 420 
tttcaacaaa cttgcaaaaa 480 

498 



<210> 28 

<211> 498 

<212> DNA 

30 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

35 <220> 

<223> Clone ID 2D7 



<400> 28 
tgtgatctgc 

40 atgggaagaa 
gaggagtttg 
atccagcaga 
ctcctagaaa 
atacaggagg 

45 aagaaatact 
tgggaggttg 
agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgtctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggcggg 
aaggacagac 
gctcaagcca 
aaggactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgacttcag 
tctctgtcct 
ctgctacttg 
ataacctgga 
tggactctat 
ggaaatacag 
tttcaacaaa 



cctggcacaa 60 
atttccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcttgcgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 29 
50 <211> 498 
<212> DNA 

<213> Artificial Sequence 



<220> 

55 <223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2D9 
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<400> 29 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
5 atccagcaga ctttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 240 
ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 
atacaggagg ttggggtgga agagactccc ctggtgaatg tggactccat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
10 agattaagga ggaaggaa 498 

<210> 30 
<211> 498 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

20 <220> 

<223> Clone ID 2DA2 

<400> 30 

tgtgatctgc ctcagaccca cagccttggt aacaggaggc ccttgatact cctggcacaa 60 

25 atgggaagaa tctctccttt ctcctgcctg aaggacagac aggacttcgg attcccccag 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaactccac cagcaactga atgaactgga agcatgtgtg 300 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 

30 aagaaatact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 31 
35 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2DH9 
45 <400> 31 

tgtgatctgc ctcagaccca cagccctggt aacaggaggg ccttgatgct cctggcacaa 60 
atgggacgaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
ggggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atgcagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 
50 ctcctagaaa aattttccac tgaactctac cggcagctga atgacctgga agcctgtgtg 300 
atacaggagg ttggggtgga agagaccccc ctgatgaatg tggactccat cctggctgtg 3 60 
aggaagtact tccaaagaat cactctttat ctgacagaga agaagcatag cccttgttcc 42 0 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 



55 



<210> 32 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

5 <220> 

<223> Clone ID 2G11 

<400> 32 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

10 atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg acttccccag 120 

gaggagtttg atggcaacca gttccagaag actcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggactcat ctgatacttg ggaacagagc 240 

ctcctagaaa aattctacat tgaacttttc cagcagctga atgacctgga agcctgcgtg 300 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 

15 agaaaatact tccaaagaat cactctttat ctgacagagg agaaatacag cccttgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 

<210> 33 
20 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2G12 
30 <400> 33 

tgtgatctgc ctcagaccca cagccttggt aacaggagga ctttgatgct catggcacaa 60 
atgaggagaa tctctccttt cccccgcctg aaggacagat atgatttcgg attcccccag 120 
gaggtgtttg atggcaacca gttccagaag gctcaagcta tcttcctttt ccatgagatg 180 
atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggatgagacc 240 
35 ctcctagaca aattctacac tgaactctac cagcagctga atgacttgga agcctgtgtg 300 
atgcaggagg ggagggtggg agaaactccc ctgatgaatg cggactccat cttggctgtg 3 60 
aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 
tgggaggctg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 
agattaagga ggaaggaa 4 98 



40 



45 



<210> 34 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



420 
480 



<220> 

50 <223> Clone ID 2H9 
<400> 34 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 

55 gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 

ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctaga agcctgtgtg 

acacaggagg ttggggtgga agagactccc ctgatgaatg aggactctat cctggctgtg 360 

aagaaatact tccaaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 420 



240 
300 



13 



WO 01/25438 



PCT/USOO/27781 



tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 35 
5 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 6BC11 
15 <400> 35 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
gaggagtttg atggcaacca gctccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 
20 ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 
atacaggagg ttggagtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga ggaaatacag cccttgtgcc 42 0 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 49 8 



25 



30 



40 



55 



<210> 36 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

35 <223> Clone ID 2DH12 
<400> 36 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
15 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
45 ^ 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

50 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Thr 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Lys Glu Thr Pro Leu Met 

100 105 HO 

Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
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5 



20 



30 



45 



115 120 125 

Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
10 165 



<210> 37 
<211> 166 
15 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2CA3 



<400> 37 

25 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asp Arg Arg Ala Met He 
15 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

35 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 

65 70 75 80 

40 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 



Glu Ala Cys Val He Gin Glu Val Gly Val Gly Glu Thr Pro Leu Met 

100 105 110 

Asn Gly Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
50 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

55 Arg Leu Arg Arg Lys Glu 

165 



<210> 38 
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<211> 166 
<212> PRT 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 4AB9 

10 

<400> 38 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
1 5 10 15 

15 Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Arg Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

20 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 
25 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 

30 Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 



35 



45 



50 



Asn Glu Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
130 135 140 



Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
40 145 150 155 160 



Arg Leu Arg Arg Lys Glu 

165 



<210> 39 

<211> 166 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

55 <223> Clone ID 2DA4 
<400> 39 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
1 5 10 15 
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Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

5 Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Ser Asn Gin Phe 

35 40 45 



10 



25 



Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
15 85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

20 Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 

115 120 125 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
30 165 



<210> 40 

<211> 166 

35 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

40 

<220> 

<223> Clone ID 3DA11 
<400> 40 

45 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Val 
1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

50 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
55 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 
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Leu Leu Glu Lys Phe 

85 

Glu Ala Cys Val lie 
5 100 

Asn Glu Asp Ser lie 
115 

10 Leu Tyr Leu lie Glu 
130 

Arg Ala Glu lie Met 
145 

15 

Arg Leu Arg Arg Lys 

165 



Ser Thr Glu Leu Tyr Gin 

90 

Gin Glu Val Gly Val Glu 

105 

Leu Ala Val Lys Lys Tyr 
120 

Arg Lys Tyr Ser Pro Cys 
135 

Arg Ser Phe Ser Phe Ser 
150 155 

Glu 



Gin Leu Asn Asp Leu 

95 

Glu Thr Pro Leu Met 
110 

Phe Gin Arg He Thr 
125 

Ala Trp Glu Val Val 
140 

Thr Asn Leu Gin Lys 

160 



20 <210> 41 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



30 



40 



55 



<220> 

<223> Clone ID 2DB11 
<400> 41 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
1 5 10 15 



35 Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

50 55 60 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
45 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

50 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 



Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
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145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 

5 

<210> 42 
<211> 166 
<212> PRT 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

15 <220> 

<223> Clone ID 2CA5 

<400> 42 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
20 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

25 Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Arg Phe 

35 40 45 



30 



45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Glu Gin Ser 

65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
35 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 

40 Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 

115 120 125 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
50 165 



<210> 43 
<211> 166 
55 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
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<220> 

<223> Clone ID 2G6 



<400> 43 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 

20 

Arg His Asp Phe 
35 

. Gin Lys Ala Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 



Gin Thr His Ser 
5 

Met Gly Arg lie 



Gly Phe Pro Gin 

40 

Ala lie Ser Val 
55 

Ser Thr Lys Asp 
70 

Phe Ser Thr Glu 
85 



Leu Gly Asn Arg 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 



Leu His Glu Met 

60 

Ser Ser Ala Thr 
75 

Leu Asn Gin Gin 
90 



Arg Ala Leu He 
15 

Cys Leu Lys Asp 
30 

Gly Asn Gin Phe 
45 

He Gin Gin Thr 



Trp Glu Gin Ser 

80 

Leu Asn Asp Leu 
95 



Glu Ala Cys Val He Gin Glu Val 

100 

Asn Val Asp Pro He Leu Ala Val 

115 120 

Leu Tyr Leu Thr Glu Lys Lys Tyr 
130 135 

Arg Ala Glu He Met Arg Ser Phe 
145 150 



Gly Val Glu Glu Thr Pro Leu Met 
105 110 

Lys Lys Tyr Phe Gin Arg He Thr 

125 

Ser Pro Cys Ala Trp Glu Val Val 

140 

Ser Phe Ser Thr Asn Leu Gin Lys 
155 160 



Arg Leu Arg Arg Lys Glu 

165 



<210> 44 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

<223> Clone ID 3AH7 



<400> 44 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 

20 

Arg His Asp Phe 
35 



Gin Thr His Ser 
5 

Met Arg Arg He 

Gly Phe Pro Gin 

40 



Leu Gly Asn Arg 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 



Arg Ala Leu He 
15 

Cys Leu Lys Asp 
30 

Ser Asn Gin Phe 
45 
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Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met lie Gin Gin Thr 
50 55 60 

5 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu His Gin Gin Leu Asn Glu Leu 

85 90 95 

10 

Glu Ala Cys Val Val Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Leu Gin Arg He Thr 
15 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

20 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



25 



50 



Arg Leu Arg Arg Lys Glu 

165 



<210> 45 
<211> 166 
<212> PRT 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

35 <220> 

<223> Clone ID 2G5 

<400> 45 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
40 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

45 Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
55 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 
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Asn Val Asp Ser lie Leu Ala Val 
115 120 

Leu Tyr Leu lie Glu Arg Lys Tyr 
5 130 135 

Arg Ala Glu lie Met Arg Ser Phe 
145 150 

10 Arg Leu Arg Arg Lys Glu 

165 



Arg Lys Tyr Phe Gin Arg lie Thr 

125 

Ser Pro Cys Ala Trp Glu Val Val 

140 

Ser Phe Ser Thr Asn Leu Gin Lys 
155 160 



<210> 46 
15 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2BA8 
25 <400> 46 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
30 20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

35 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

40 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
45 100 105 110 

Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

50 Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

55 

Arg Leu Arg Arg Lys Glu 

165 
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5 



<210> 47 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

10 <223> Clone ID 1F3 
<400> 47 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 



15 



30 



Leu Leu Gly Gin Met Gly Arg lie Ser His Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

20 35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 

50 55 60 

25 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Val Ala Trp Asp Glu Arg 

65 70 75 80 



Leu Leu Asp Lys Leu Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Trp Val Gly Gly Thr Pro Leu Met 

100 105 110 



Asn Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 
35 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

40 Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 

45 

<210> 48 
<211> 166 
<212> PRT 
50 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

55 <220> 

<223> Clone ID 4BE10 

<400> 48 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
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5 



20 



15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu He Met Gin Gin Thr 

10 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 

15 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 



Glu Ala Cys Val He Gin Gly Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 

Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
25 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

30 Arg Leu Arg Arg Lys Glu 

165 



<210> 49 

35 <211> 166 

<212> PRT 

<213> Artificial Sequence 



<220> 

40 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2DD9 



45 <400> 49 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
50 20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

55 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 
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Leu Leu Glu Lys Phe 

85 

5 Glu Ala Cys Val lie 

100 

Asn Glu Asp Ser lie 
115 

10 

Leu Tyr Leu Thr Glu 
130 

Arg Ala Glu lie Met 
15 145 

Arg Leu Arg Arg Lys 

165 



Ser Thr Gly Leu Tyr 

90 

Gin Glu Val Gly Val 

105 

Leu Ala Val Lys Lys 
120 

Lys Lys Tyr Ser Pro 
135 

Arg Ser Phe Ser Phe 
150 

Glu 



Gin Gin Leu Asn Asp Leu 

95 

Glu Glu Thr Pro Leu Met 

110 

Tyr Phe Gin Arg He Thr 
125 

Cys Ser Trp Glu Val Val 
140 

Ser Thr Asn Leu Gin Lys 
155 160 



<210> 50 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

30 <223> Clone ID 3CA1 



<400> 50 
Cys Asp Leu Pro 
1 

35 

Leu Leu Ala Gin 

20 

Arg His Asp Phe 
40 35 

Gin Lys Ala Gin 
50 

45 Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

50 

Glu Ala Cys Val 

100 

Asn Val Asp Ser 
55 115 

Leu Tyr Leu Thr 
130 



Gin Thr His Ser Leu Gly 
5 10 

Met Gly Arg He Ser Pro 

25 

Gly Leu Pro Gin Glu Glu 

40 

Ala lie Ser Val Leu His 
55 

Ser Thr Lys Asn Ser Ser 
70 

Phe Ser Thr Glu Leu Tyr 
85 90 

He Gin Glu Val Gly Met 

105 

He Leu Ala Val Lys Lys 

120 

Glu Lys Lys Tyr Ser Pro 
135 



Asn Arg Arg Ala Leu He 

15 

Phe Ser Cys Leu Lys Asp 

30 

Phe Asp Gly Asn Gin Phe 
45 

Glu Met He Gin Gin Thr 
60 

Ala Ala Trp Asp Glu Thr 
75 80 

Gin Gin Leu Asn Asn Leu 

95 

Glu Glu Thr Pro Leu Met 

110 

Tyr Phe Gin Arg He Thr 
125 

Cys Ala Trp Glu Val Val 
140 
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Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
5 165 



<210> 51 
<211> 166 
10 <212> PRT 

<213> Artificial Sequence 



15 



25 



40 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2F8 



<400> 51 

20 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
1 5 10 15 



Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 

30 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 

35 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 



Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Glu Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
45 130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

50 Arg Leu Arg Arg Lys Glu 

165 



<210> 52 
55 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 
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5 



15 



30 



40 



45 



<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 6CG3 
<400> 52 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Lys Arg Ala Met Met 
15 10 15 



10 Leu Leu Ala Gin Met Gly Arg Thr Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Arg Ala Gin Ala He Phe Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



Phe Asn Phe Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
20 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 

25 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
35 145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 



<210> 53 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acxd 



<220> 

50 <223> Clone ID 3CG7 



<400> 53 
Cys Asp Leu Pro 
1 

55 

Leu Leu Ala Gin 

20 

Arg His Asp Phe 



Gin Thr His Ser 
5 

Met Gly Arg He 
Gly Phe Pro Gin 



Leu Gly Asn Ser 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 



Arg Ala Leu Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn Gin Phe 
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35 

Gin Lys Ala 
50 

5 

Phe Asn Leu 
65 

Leu Leu Glu 

10 

Glu Ala Cys 



15 Asn Val Asp 

115 

Leu Tyr Leu 
130 

20 

Arg Ala Glu 
145 

Arg Leu Arg 

25 



Gin Ala lie Ser 

55 

Phe Ser Thr Lys 
70 

Lys Phe Ser Thr 
85 

Val He Gin Glu 
100 

Ser He Leu Ala 



He Glu Arg Lys 

135 

He Met Arg Ser 
150 

Arg Lys Glu 
165 



40 

Ala Phe His Glu Met 

60 

Asp Ser Ser Ala Ala 

75 

Glu Leu Tyr Gin Gin 
90 

Val Gly Met Glu Glu 
105 

Val Arg Lys Tyr Phe 
120 

Tyr Ser Pro Cys Ala 

140 

Phe Ser Phe Ser Thr 

155 



45 

He Gin Gin Thr 



Trp Glu Gin Asn 

80 

Leu Asn Asn Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg He Thr 
125 

Trp Glu Val Val 



Asn Leu Gin Lys 

160 



<210> 54 
<211> 166 
30 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

35 

<220> 

<223> Clone ID 1D3 
<400> 54 

40 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser His Phe Ser Cys Leu Lys Asp 

20 25 30 

45 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly His Gin Phe 
35 40 45 

Gin Lys Thr Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

55 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 
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Asn Glu Asp Ser lie 
115 

5 Leu Tyr Leu Met Glu 
130 

Arg Ala Glu lie Met 
145 

10 

Arg Leu Arg Arg Lys 

165 



Leu Ala Val Lys Lys Tyr 
120 

Lys Lys Tyr Ser Pro Cys 
135 

Arg Ser Phe Ser Phe Ser 
150 155 

Glu 



Phe Gin Arg lie Thr 
125 

Ala Trp Glu Val Val 
140 

Thr Asn Leu Gin Lys 

160 



15 <210> 55 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



25 



35 



50 



<220> 

<223> Clone ID 2G4 
<400> 55 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Met Met 
15 10 15 



30 Leu Leu Ala Gin Met Ser Arg lie Ser Pro Ser Ser Cys Leu Met Asp 

20 25 30 



Arg His Asp Phe Glu Phe Pro Gin Glu Glu Phe Asp Asp Lys Gin Phe 
35 40 45 

Gin Lys Ala Pro Ala lie Ser Val Leu His Glu Val He Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Glu Gin Thr 

40 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

45 Glu Ala Cys Val Met Gin Glu Glu Arg Val Gly Glu Thr Pro Leu Met 

100 105 110 



Asn Ala Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Lys Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
55 145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 
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<210> 56 
<211> 166 
<212> PRT 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

10 <220> 

<223> Clone ID lAl 

<400> 56 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser His Phe Ser Cys Leu Lys Asp 

20 25 30 

20 Arg Tyr Asp Phe Gly Phe Pro Gin Glu Val Phe Asp Gly Asn Gin Phe 

35 " 40 45 



25 



40 



Gin Lys Ala Gin Ala lie Ser Ala Phe His Glu Met Met Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu His Gin Gin Leu Asn Asp Leu 
30 85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

35 Asn Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 



Leu Tyr Leu Met Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
45 165 



<210> 57 
<211> 166 
50 <212> PRT 

<213> Artificial Sequence 



55 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 1D10 
<400> 57 
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Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
5 20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Leu 
35 40 45 

10 Gin Lys Thr Gin Ala lie Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

15 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Gly Val Gly Val Glu Glu Thr Pro Pro Met 
20 100 105 HO 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

25 Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

30 

Arg Leu Arg Arg Lys Glu 

165 



35 <210> 58 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
40 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 1F6 

45 

<400> 58 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 

50 He Met Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

55 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
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65 

Leu Leu Glu Lys Phe 

85 

5 

Glu Ala Cys Val He 

100 

Asn Val Asp Ser He 
10 115 

Leu Tyr Leu Thr Glu 
130 

15 Arg Ala Glu He Met 
145 

Arg Leu Arg Arg Lys 

165 

20 



70 

Ser Thr Glu Leu Asn 

90 

Gin Glu Ala Gly Val 

105 

Leu Ala Val Lys Lys 
120 

Lys Lys Tyr Ser Pro 
135 

Arg Ser Phe Ser Phe 
150 

Glu 



75 80 

Gin Gin Leu Asn Asp Leu 

95 

Glu Glu Thr Pro Leu Met 

110 

Tyr Phe Gin Arg He Thr 
125 

Cys Ala Trp Glu Val Val 
140 

Ser Thr Asn Leu Gin Lys 
155 160 



<210> 59 
<211> 166 
<212> PRT 
25 <213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

30 <220> 

<223> Clone ID 2A10 

<400> 59 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
35 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser His Phe Ser Cys Leu Lys Asp 

20 25 30 

40 Arg Tyr Asp Phe Gly Phe Pro Gin Glu Val Phe Asp Gly Asn Gin Phe 

35 40 45 



45 



Gin Lys Ala Gin Ala He Ser Ala Phe His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 

65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 
50 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

55 Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 

115 120 125 

Leu Tyr Leu Met Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 
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Arg Ala Glu lie Met 
145 

5 Arg Leu Arg Arg Lys 

165 



Arg Ser Phe Ser Phe Ser 
150 155 

Glu 



Thr Asn Leu Gin Lys 

160 



<210> 60 
10 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2C3 
20 <400> 60 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
1 5 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 
25 20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Ser 
35 40 45 

30 Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 
50 55 60 



35 



50 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Asp Thr Trp Asp Ala Thr 

65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 



Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
40 100 105 110 

Asn Val Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
115 120 125 

45 Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 



Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 



55 <210> 61 
<211> 166 
<212> PRT 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

5 <223> Clone ID 2D1 
<400> 61 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 



10 



25 



40 



Leu Leu Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

15 35 40 45 

Gin Lys Ala Gin Ala lie Ser Ala Phe His Glu Met lie Gin Gin Thr 

50 55 60 

20 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 

65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Met Glu Glu Thr Pro Leu Met 

100 105 110 



Asn Glu Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
30 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

35 Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 155 160 



Arg Leu Arg Arg Lys Glu 

165 



<210> 62 

<211> 166 

<212> PRT 

45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

50 <220> 

<223> Clone ID 2D10 

<400> 62 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
55 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg Val Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 
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Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Ala Phe His Glu Met He Gin Gin Thr 
5 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

10 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 

15 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
20 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

25 Arg Leu Arg Arg Lys Glu 

165 



<210> 63 

30 <211> 166 

<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2D7 

40 <400> 63 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
45 20 25 . 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

50 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 



55 
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100 105 HO 

Asn Val Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 



<210> 64 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2D9 
<400> 64 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Val 

100 105 110 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 
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<210> 65 
<211> 166 
5 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

10 

<220> 

<223> Clone ID 2DA2 
<400> 65 

15 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Pro Leu lie 
15 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

20 

Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 
25 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

30 Leu Leu Glu Lys Phe Ser Thr Glu Leu His Gin Gin Leu Asn Glu Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

35 

Asn Val Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
115 120 125 

Leu Tyr Leu lie Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
40 130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

45 Arg Leu Arg Arg Lys Glu 

165 



<210> 66 

50 <211> 166 

<212> PRT 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2DH9 
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10 



25 



35 



40 



50 



<400> 66 

Cys Asp Leu Pro Gin Thr His Ser Pro Gly Asn Arg Arg Ala Leu Met 
15 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Gly Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 

15 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Arg Gin Leu Asn Asp Leu 

85 90 95 

20 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 



Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys His Ser Pro Cys Ser Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
30 145 150 155 160 



Arg Leu Arg Arg Lys Glu 

165 



<210> 67 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

45 <223> Clone ID 2G11 
<400> 67 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
1 5 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Leu Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

55 35 40 45 

Gin Lys Thr Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

50 55 60 
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Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Asp Thr Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Leu Asn Asp Leu 
5 85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

10 Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 

115 120 125 

Leu Tyr Leu Thr Glu Glu Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

15 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
20 165 



<210> 68 

<211> 166 

25 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

30 

<220> 

<223> Clone ID 2G12 
<400> 68 

35 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 



40 



55 



Leu Met Ala Gin Met Arg Arg He Ser Pro Phe Pro Arg Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Val Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Phe Leu Phe His Glu Met Met Gin Gin Thr 

45 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 

50 Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 



Glu Ala Cys Val Met Gin Glu Gly Arg Val Gly Glu Thr Pro Leu Met 

100 105 HO 

Asn Ala Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Ala Val 
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130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

5 

Arg Leu Arg Arg Lys Glu 

165 



10 <210> 69 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
15 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



20 



30 



45 



<220> 

<223> Clone ID 2H9 
<400> 69 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 



25 Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
35 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 

40 Glu Ala Cys Val Thr Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
50 145 150 155 160 

Arg Leu Arg Arg Lys Glu 

165 

55 

<210> 70 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

5 <220> 

<223> Clone ID 6BC11 

<400> 70 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
10 1 5 10 .15 

Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

15 Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Leu 

35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 
50 55 60 

20 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
25 85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

30 Asn Val Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

35 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
40 165 



<210> 71 
<211> 166 
45 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

50 

<220> 

<223> Clone ID tl9bb 

<220> 
55 <221> M0D_RES 
<222> (11) 
<223> N or D 

<220> 
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<221> M0D_RES 

<222> (12) 

<223> R, S, or K 

<220> 

<221> M0D_RES 
<222> (15) 
<223> L or M 

<220> 

<221> M0D_RES 
<222> (16) 
<223> I, M or V 

<220> 

<221> M0D_RES 
<222> (19) 
<223> A or G 

<220> 

<221> MOD.RES 
<222> (22) 
<223> G or R 

<220> 

<221> MOD_RES 
<222> (24) 
<223> I or T 

<220> 

<221> M0D_RES 
<222> (26) 
<223> P or H 

<220> 

<221> MOD_RES 
<222> (34) 
<223> H, Y or Q 

<220> 

<221> M0D_RES 
<222> (38) 
<223> F or L 

<220> 

<221> MOD_RES 
<222> (40) 
<223> Q or R 

<220> 

<221> MOD_RES 
<222> (45) 
<223> G or S 

<220> 

<221> M0D_RES 
<222> (46) 
<223> N or H 
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<220> 

<221> MOD_RES 

<222> (47) 

<223> Q or R 

<220> 

<221> MOD_RES 
<222> (50) 
<223> K or R 

<220> 

<221> MOD_RES 
<222> (51) 
<223> A or T 

<220> 

<221> MOD_RES 
<222> (55) 
<223> S or F 

<220> 

<221> M0D_RES 
<222> (56) 
<223> V or A 

<220> 

<221> M0D_RES 
<222> (57) 
<223> L or F 

<220> 

<221> M0D_RES 
<222> (60) 
<223> M or I 

<220> 

<221> MOD_RES 
<222> (61) 
<223> I or M 

<220> 

<221> M0D_RES 
<222> (67) 
<223> L or F 

<220> 

<221> M0D — RES 
<222> (72) 
<223> D or N 

<220> 

<221> M0D_RES 
<222> (75) 
<223> A or V 

<220> 

<221> M0D_RES 
<222> (76) 
<223> A or T 
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<220> 

<221> MOD_RES 

<222> (78) 

<223> E or D 

<220> 

<221> MOD^RES 
<222> (79) 
<223> Q or E 

<220> 

<221> MOD_RES 
<222> (80) 

<223> S, R, T, or N 
<220> 

<221> M0D_RES 
<222> (83) 
<223> E or D 

<220> 

<221> MOD_RES 
<222> (85) 
<223> F or L 

<220> 

<221> MOD_RES 
<222> (86) 
<223> S or Y 

<220> 

<221> MOD_RES 
<222> (88) 
<223> E or G 

<220> 

<221> MOD_RES 
<222> (90) 
<223> Y, H, N 

<220> 

<221> MOD_RES 

<222> (95) 

<223> D, E, or N 

<220> 

<221> MOD_RES 

<222> (101) 

<223> I, M, or V 

<220> 

<221> M0D_RES 

<222> (103) 

<223> E or G 

<220> 

<221> MOD_RES 
<222> (105) 
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<223> G or W 
<220> 

<221> MOD_RES 
5 <222> (106) 
<223> V or M 

<220> 

<221> MOD_RES 
10 <222> (107) 

<223> E, G, or K 

<220> 

<221> MOD_RES 
15 <222> (108) 
<223> E or G 

<220> 

<221> MOD_RES 
20 <222> (114) 

<223> V, E, or G 

<220> 

<221> MOD_RES 
25 <222> (116) 
<223> S or P 

<220> 

<221> MOD_RES 
30 <222> (121) 

<223> K or R 

<220> 

<221> M0D_RES 
35 <222> (124) 
<223> F or L 

<220> 

<221> M0D_RES 
40 <222> (132) 

<223> T, I, or M 

<220> 

<221> MOD_RES 
45 <222> (134) 
<223> K or R 

<220> 

<221> M0D_RES 
50 <222> (140) 
<223> A or S 

<400> 71 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Xaa Xaa Arg Ala Xaa Xaa 
55 1 5 10 15 

Leu Leu Xaa Gin Met Xaa Arg Xaa Ser Xaa Phe Ser Cys Leu Lys Asp 

20 25 30 
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Arg Xaa Asp Phe Gly Xaa Pro Xaa Glu Glu Phe Asp Xaa Xaa Xaa Phe 
35 40 45 

Gin Xaa Xaa Gin Ala lie Xaa Xaa Xaa His Glu Xaa Xaa Gin Gin Thr 
5 50 55 60 

Phe Asn Xaa Phe Ser Thr Lys Xaa Ser Ser Xaa Xaa Trp Xaa Xaa Xaa 
65 70 75 80 

10 Leu Leu Xaa Lys Xaa Xaa Thr Xaa Leu Xaa Gin Gin Leu Asn Xaa Leu 

85 90 95 

Glu Ala Cys Val Xaa Gin Xaa Val Xaa Xaa Xaa Xaa Thr Pro Leu Met 

100 105 110 

15 

Asn Xaa Asp Xaa lie Leu Ala Val Xaa Lys Tyr Xaa Gin Arg lie Thr 
115 120 125 

Leu Tyr Leu Xaa Glu Xaa Lys Tyr Ser Pro Cys Xaa Trp Glu Val Val 
20 130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

25 Arg Leu Arg Arg Lys Glu 

165 



<210> 72 

30 <211> 498 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID CHI . 1 



40 



45 



50 



55 



<400> 72 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccaacaga 

cttctagaca 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atgacaacca 
ccttcaatct 
aattctacac 
ttggggtgga 
tccgaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgtctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctctat 
aatcatgaga 



aacaggaggg 
atggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgctacttg 
atgacctgga 
aggactccat 
agaaatacag 
tttcaacaaa 



cctggcacaa 60 
atttccccag 120 
ccatgagatg 180 
ggatgagaca 240 
agcctgcgtg 300 
cttggctgtg 360 
cccttgtgcc 420 
c t tgcaaaaa 480 

498 



<210> 73 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 
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<223> Clone ID CHl . 2 



10 



<400> 73 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttccatct 
aattttccac 
ttggggtgga 
tccgaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaacttaac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagac 
gctcaaggca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgctacttg 
atgacctgga 
tggactccat 
agaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcctgcgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 74 
15 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: Synthetic DNA 



25 



30 



35 



40 



<220> 

<223> Clone ID CHI . 3 



<400> 74 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

cttctagaca 

atgcaggagg 

agaaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattctacac 
ttggagtgga 
ttcgaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agacactcct 
cactctttat 
aatcatgaga 



aacaggagga 
aaggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ctttgatgat 
atgactttgg 
tctctgtcct 
ctgctacttg 
atgacctgga 
tggactctat 
agaaatacag 
tttcaacaaa 



aatggcacaa 60 
atttcctcag 120 
ccatgagatg 180 
ggatgagaca 240 
agcctgtatg 300 
cctgactgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 75 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

45 <223> Clone ID CHI . 4 



<400> 75 

tgtgatctgc 

atgggaagaa 

50 gaggagtttg 
atccagcaga 
ctcctagaca 
atgcaggagg 
aagaaatact 

55 tgggaggttg 
agattaagga 

<210> 76 
<211> 498 



ctcagaccca 
tctctccttt 
gtggcaacca 
ccttcaatct 
aattctacat 
agagggtggg 
tccaaagaat 
tcagagcaga 
ggaaggaa 



cagcctgggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaacttttc 
agaaactccc 
cactctttat 
aatcatgaga 



aataggaggg 
aaggacagac 
gctcaagcca 
gaggactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
cggactccat 
agaaatacag 
tttcaacaaa 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggatgagacc 240 
agcctgtgtg 300 
cttggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 
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<212> DNA 

<213> Artificial 



Sequence 



10 



15 



20 



25 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID CH2 . 1 



<400> 76 

tgtgatctgc 

atgggaagaa 

gaggagtttg 

atccagcaga 

cttctagaca 

atacaggagg 

aagaaatact 

tgggaggttg 

agattaagga 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattctacac 
ttggggtgga 
tccgaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctctat 
aatcatgaga 



aacaggagga 
aaggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ctttgatgat 
atgactttgg 
tctctgtcct 
ctgctacttg 
atgacctgga 
aggactccat 
agaaatacag 
tttcaacaaa 



aatggcacaa 60 
atttcctcag 120 
ccatgagatg 180 
ggatgagaca 240 
agcctgtatg 300 
cttggctgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 77 
<211> 498 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

30 <223> Clone ID CH2 . 2 



<400> 77 

tgtgatctgc 

atgggaagaa 

35 gaggagtttg 
atccaacaga 
cttctagaca 
atgcaggagg 
aagaaatact 

40 tgggaggttg 
agattaagga 



ctcagaccca 
tctctccttt 
atgacaacca 
ccttcaatct 
aattctacac 
ttggagtgga 
tccgaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcctgtctg 
gttccagaag 
cttcagcaca 
tgaactttac 
agacactcct 
cactctttat 
aatcatgaga 



aacaggaggg 
atggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccttgatact 
atgactttgg 
tctctgtcct 
ctgctacttg 
atgacctgga 
tggactctat 
agaaatacag 
tttcaacaaa 



cctggcacaa 60 
atttccccag 120 
ccatgagatg 180 
ggatgagaca 240 
agcctgtatg 300 
cctgactgtg 360 
cccttgtgcc 420 
cttgcaaaaa 480 

498 



<210> 78 
<211> 498 
45 <212> DNA 

<213> Artificial Sequence 



50 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID CH2 . 3 



<400> 78 
55 tgtgatctgc 
atgggaagaa 
gaggagtttg 
atccagcaga 
cttctagaca 



ctcagaccca 
tctctccttt 
atggcaacca 
ccttcaatct 
aattctacac 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactttac 



aacaggagga 
aaggacagac 
gctcaagcca 
aaggactcat 
cagcagctga 



ctttgatgat 
atgactttgg 
tctctgtcct 
ctgctacttg 
atgacctgga 



aatggcacaa 60 
atttcctcag 12 0 
ccatgagatg 180 
ggatgagaca 240 
agcctgtatg 300 
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10 



20 



35 



50 



atgcaggagg ttggagtgga agacactcct ctgatgaatg aggactccat cttggctgtg 3 60 
aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tctcaacaaa cttgcaaaaa 480 



agattaagga ggaaggaa 

<210> 79 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

15 <223> Clone ID CHI . 1 
<400> 79 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 



Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Met Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Asp Asn Gin Phe 
25 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

30 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 70 75 80 



Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 

40 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

45 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 . 150 155 160 



Arg Leu Arg Arg Lys Glu 

165 



<210> 80 
<211> 166 
<212> PRT 
55 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



498 
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<220> 

<223> Clone ID CHI . 2 
<400> 80 

5 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



10 



25 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Gly He Ser Val Leu His Glu Met He Gin Gin Thr 
15 50 55 60 

Phe His Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

20 Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 



Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 HO 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
30 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

35 Arg Leu Arg Arg Lys Glu 

165 



<210> 81 
40 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID CHI . 3 
50 <400> 81 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 

He. Met Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
55 20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 
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Gin Lys Ala 
50 

Phe Asn Leu 
5 65 

Leu Leu Asp 



10 Glu Ala Cys 



Asn val Asp 
115 

15 

Leu Tyr Leu 
130 

Arg Ala Glu 
20 145 



Gin Ala lie Ser 

55 

Phe Ser Thr Lys 
70 

Lys Phe Tyr Thr 
85 

Met Met Gin Glu 
100 

Ser lie Leu Thr 



Thr Glu Lys Lys 

135 



lie Met Arg Ser 
150 



Val Leu His Glu 



Asp Ser Ser Ala 

75 

Glu Leu Tyr Gin 
90 

Val Gly Val Glu 
105 

Val Arg Lys Tyr 
120 

Tyr Ser Pro Cys 



Phe Ser Phe Ser 

155 



Met He Gin Gin Thr 
60 

Thr Trp Asp Glu Thr 

80 

Gin Leu Asn Asp Leu 

95 

Asp Thr Pro Leu Met 
110 

Phe Arg Arg He Thr 
125 

Ala Trp Glu Val Val 
140 

Thr Asn Leu Gin Lys 

160 



Arg Leu Arg Arg Lys Glu 

165 



25 

<210> 82 
<211> 166 
<212> PRT 

<213> Artificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

35 <223> Clone ID CHl . 4 
<400> 82 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



40 



55 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Gly Gly Asn Gin Phe 
45 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

50 Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 



Leu Leu Asp Lys Phe Tyr He Glu Leu Phe Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Glu Arg Val Gly Glu Thr Pro Leu Met 

100 105 HO 

Asn Ala Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
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115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

5 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
10 165 



<210> 83 
<211> 166 
15 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

20 

<220> 

<223> Clone ID CH2 . 1 
<400> 83 

25 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 



30 



45 



He Met Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
35 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 70 75 80 

40 Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 



Glu Ala Cys Met He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
50 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 155 160 

55 Arg Leu Arg Arg Lys Glu 

165 



<210> 84 
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<211> 166 
<212> PRT 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID CH2 . 

10 

<400> 84 

Cys Asp Leu Pro Gin 
1 5 

15 Leu Leu Ala Gin Met 

20 

Arg His Asp Phe Gly 
35 

20 

Gin Lys Ala Gin Ala 
50 

Phe Asn Leu Phe Ser 
25 65 

Leu Leu Asp Lys Phe 

85 

30 Glu Ala Cys Met Met 

100 

Asn Val Asp Ser lie 
115 

35 

Leu Tyr Leu Thr Glu 
130 

Arg Ala Glu lie Met 
40 145 

Arg Leu Arg Arg Lys 

165 

45 

<210> 85 
<211> 166 
<212> PRT 

<213> Artificial Sequence 

50 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

55 <223> Clone ID CH2 . 3 
<400> 85 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 



2 



Thr His Ser Leu Gly Asn 

10 

Gly Arg lie Ser Pro Phe 

25 

Phe Pro Gin Glu Glu Phe 
40 

lie Ser Val Leu His Glu 
55 

Thr Lys Asp Ser Ser Ala 
70 75 

Tyr Thr Glu Leu Tyr Gin 

90 

Gin Glu Val Gly Val Glu 

105 

Leu Thr Val Lys Lys Tyr 
120 

Lys Lys Tyr Ser Pro Cys 
135 

Arg Ser Phe Ser Phe Ser 
150 155 

Glu 



Arg Arg Ala Leu lie 

15 

Ser Cys Leu Met Asp 
30 

Asp Asp Asn Gin Phe 
45 

Met He Gin Gin Thr 
60 

Thr Trp Asp Glu Thr 

80 



Gin Leu Asn Asp Leu 

95 

Glu Thr Pro Leu Met 
110 

Phe Arg Arg He Thr 
125 

Ala Trp Glu Val Val 
140 

Thr Asn Leu Gin Lys 

160 



53 



WO 01/25438 



PCT/US00/27781 



lie Met Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Met Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Glu Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Arg Arg lie Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 ' 160 

Arg Leu Arg Arg Lys Glu 

165 



<210> 86 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 86 

tgcgacttac cacaa 

<210> 87 
<211> 26 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 87 

Trp Glu Val Val Arg Ser Glu lie 
1 5 

Asn Leu Gin Arg Arg Leu Arg Arg 

20 



Sequence: Synthetic DNA 

15 

Sequence: Synthetic amino acid 

Met Arg Ser Phe Ser Tyr Ser Thr 
10 15 

Lys Asp 
25 
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<210> 88 
<211> 26 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<400> 88 

10 Trp Glu Leu Val Arg Ala Glu lie Val Arg Ser Phe Ser Phe Ser Thr 
15 10 15 



Asn Leu Asn Lys Arg Leu Arg Lys Lys Glu 

20 25 



15 
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MUTANT RECOMBINANT ADENO-ASSOCIATED VIRUSES 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. application Serial No. 
10/022,390, filed December 17, 2001 , to Manuel Vega, Lila Drittanti and 
Marjorie Flaux entitled "MUTANT RECOMBINANT ADENO-ASSOCIATED 
5 VIRUSES." Benefit of priority is also claimed to U.S. provisional patent 
application serial No. 60/315,382, filed August 27, 2001, to Manuel 
Vega, Lila Drittanti, and Marjorie Flaux entitled "HIGH THROUGHPUT 
DIRECTED EVOLUTION BY RATIONAL MUTAGENESIS." Where 
permitted, the subject matter ofeach of these applications applications is 

10 incorporated in its entirety by reference thereto. 
FIELD OF INVENTION 

Mutant adeno-associated virus Rep proteins, recombinant viruses 
that express the proteins and nucleic acid molecule encoding the Rep 
proteins are provided. Uses of the recombinant viruses for treatment of 

15 diseases and vectors for gene therapy are also provided. 
BACKGROUND 

Adeno-associated virus (AAV) is a defective and non-pathogenic 
parvovirus that requires co-infection with either adenovirus or a herpes 
virus, which provide helper functions, for its growth and multiplication. 

20 There is an extensive body of knowledge regarding AAV biology and 
genetics (see, e.g., Weitzman eta/. (1996) J. Virol. 70: 2240-2248 
(1996); Walker eta/. (1 997) J. Virol. 77:2722-2730; Urabe et al. (1999) 
J. Virol. 23:2682-2693; Davis et al. (2000) J. Virol. 23:74:2936-2942; 
Yoon etal. (2001) J. Virol. 75:3230-3239; Deng et al. (1992) Anal 

25 Biochem 200:81-85; Drittanti etal. (2000) Gene Therapy 7:924-929; 
Srivastava etal. (1983) J. Virol. 45:555-564; Hermonat etal. (1984) J. 
Virol. 57:329-339; Chejanovsky et al. (1989) Virology /73:120-128; 
Chejanovsky et al. (1990)./. Virol. 64:1764-1770; Owens et al. (1991) 
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Virology 784:14-22; Owens eta/. (1992) J. Virol. 55:1236-1240; 
Qicheng Yang eta/. (1992) J. Virol. 55:6058-6069; Qicheng Yang eta/. 
(1993) J. Virol. 57:4442-4447; Owens et a/. (1993) J. WroA 52:997- 
1005; Sirkka efaA (1 994) J. Virol. 55:2947-2957; Ramesh eta/. (1995) 
5 Biochem. Biophy. Res. Com. Vol 210 (3), 717-725; Sirkka (1995) J. 
WroA 55:6787-6796; Sirkka et a/. (1996) Biochem. Biophy. Res. Com. 
220:294-299; Ryan eta/. (1996) J. l//r*A 70:1542-1553; Weitzman eta/. 
(1 996) J. Virol. 70:2440-2448; Walker ef al. (1997) J. VZroA 77:2722- 
2730; Walker ef a/. (1997) J. Wro/. 77:6996-7004; Davis et al. (1999) J. 
10 Virol. 73:2084-2093; Urabe ef at. (1999) ^ WroA 73:2682-2693; Gavin 
et at. (1999) J. WroA 73:9433-9445; Davis et al. (2000) J. Virof. 
74:2936-2942; Pei Wu et al. (2000) J. VTroA 74:8635-8647; Alessandro 
Marcello et al. (2000) J. Virol. 74:9090-9098). AAV are members of the 
family Parvoviridae and are assigned to the genus Dependovirus. 
15 Members of this genus are small, non-enveloped, icosahedral with linear 
and single-stranded DNA genomes, and have been isolated from many 
species ranging from insects to humans. 

AAV can either remain latent after integration into host chromatin 
or replicate following infection. Without co-infection, AAV can enter host 
20 cells and preferentially integrate at a specific site on the q arm of 
chromosome 19 in the human genome. 

The AAV genome contains 4975 nucleotides and the coding 
sequence is flanked by two inverted terminal repeats (ITRs) on either side 
that are the only sequences in cis required for viral assembly and 
25 replication. The ITRs contain palindromic sequences, which form a*hairpin 
secondary structure, containing the viral origins of replication. The ITRs 
are organized in three segments: the Rep binding site (RBS), the terminal 
resolution site (TRS), and a spacer region separating the RBS from 
the TRS. 
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Regulation of AAV genes, is complex and involves positive and 
negative regulation of viral transcription. For example, the regulatory 
proteins Rep 78 and Rep 68 interact with viral promoters to establish a 
feedback loop (Beaton eta!. (1989) J. Virol 53:4450-4454; Hermonat 
5 ( 1 994) Cancer Lett 8 1 : 1 29- 1 36) . Expression from the p5 and p 1 9 

promoters is negatively regulated in trans by these proteins. Rep 78 and 
68, which are required for this regulation, have bind to inverted terminal 
repeats (ITRs; Ashktorab eta/. (1989) J. Virol. 53:3034-3039) in a site- 
and strand-specific manner, in vivo and in vitro. This binding to ITRs 
10 induces a cleavage at the TRS and permits the replication of the hairpin 
structure, thus, illustrating the Rep helicase and endonuclease activities 
(Im etal. (1990) Cell 67:447-457; and Walker et al. (1997) J. Virol. 
7/:6996-7004), and the role of these non-structural proteins in the initial 
steps of DNA replication (Hermonat etal. (1984) J. Virol. 52:329-339). 
15 Rep 52 and 40, the two minor forms of the Rep proteins, do not bind to 
ITRs and are dispensable for viral DNA replication and site-specific 
integration (Im etal. (1992) J. Virol. 66:1 1 19-1 12834; Ni etal. (1994) J. 
Virol. 65:1128-1138. 

The genome (see, FIG. 1) is organized into two open reading 
20 frames (ORFs, designated left and right) that encode structural capsid 
proteins (Cap) and non-structural proteins (Rep). There are three 
promoters: p5 (from nucleotides 255 to 261: TATTTAA), p19 (from 
nucleotide 843 to 849: TATTTAA) and p40 (from nucleotides 1822 to 
1827: ATATAA). The right-side ORF (see FIG. 1) encodes three capsid 
25 structural proteins (Vp 1-3). These three proteins, which are encoded by 
overlapping DNA, result from differential splicing and the use of an 
unusual initiator codon (Cassinoti etal. (1988) Virology 767:176-184). 
Expression of the capsid genes is regulated by the p40 promoter. Capsid 
proteins VP1, VP2 and VP3 initiate from the p40 promoter. VP1 uses an 
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alternate splice acceptor at nucleotide 2201; whereas VP2 and VP3 are 
derived from the same transcription unit, but VP2 use an ACG triplet as 
an initiation codon upstream from the start of VP3. On the left side of 
the genome, two promoters p5 and p19 direct expression of four 
5 regulatory proteins. The left flanking sequence also uses a differential 
splicing mechanism (Mendelson eta/. (1986) J. Virol 50:823-832) to 
encode the Rep proteins, designated Rep 78, 68, 52 and 40 on the basis 
molecular weight. Rep 78 and 68 are translated from a transcript 
produced from the p5 promoter and are produced from the unspliced and 
10 spliced form, respectively, of the transcript. Rep 52 and 40 are the 
translation products of unspliced and spliced transcripts from the p19 
promoter. 

AAV and rAAV have many applications, including use as a gene 
transfer vector, for introducing heterologous nucleic acid into cells and for 

15 genetic therapy. Advances in the production of high-titer rAAV stocks to 
the transition to human clinical trials have been made, but improvement of 
rAAV production will be complemented with special attention to clinical 
applications of rAAV vectors as a successful gene therapy approach. 
Productivity of rAAV (i.e. the amount of vector particles that can be 

20 obtained per unitary manufacturing operation) is one of the rate limiting 
steps in the further development of rAAV as gene therapy vector. 
Methods for high throughput production and screening of rAAV have 
been developed (see, e.g., Drittanti eta/. (2000) Gene Therapy 7:924- 
929). Briefly, as with the other steps in methods provided herein, the 

25 plasmid preparation, transfection, virus productivity and titer and 
biological activity assessment are intended to be performed in an 
automatable high throughput format, such as in a 96 well or loci formats 
(or other number of wells or multiples of 96, such as 384, 1 536 . . . 
9600, 9984 . . well or loci formats). 
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SUMMARY 

Mutant AAV Rep proteins, nucleic acid molecules encoding such 
proteins, and rAAV that encode the proteins are provided. Among the 
rep proteins are those that result in increased rAAV production in rAAV 
5 that encode such mutants, thereby, among a variety of advantages, 
offering a solution to the need in the gene therapy industry to increase 
the production of therapeutic vectors without up-scaling manufacturing. 
Methods of gene therapy using the rAAV are provided. 

Directed evolution methods provided in co-pending U.S. provisional 
10 application Serial No. 60/315,382, filed as U.S. application Serial No. 

10/022,249, and described herein have been used to identify amino acid 
"hit" positions in adeno-associated virus (AAV) rep proteins that are 
relevant for AAV or rAAV production. Those amino acid positions are 
selected such that a change in the amino acid leads to a change in protein 
15 activity either to lower activity or to higher activity compared to native- 
sequence Rep proteins. The hit positions were then used to generate 
further mutants designated "leads." Provided herein are the resulting 
mutant rep proteins that result in either higher or lower levels of AAV or 
rAAV virus compared to the wild-type (native) Rep protein (s). Nucleic 
20 acid molecules that encode the mutant Rep proteins are also provided. 

Also provided are rAAV that contain the nucleic acid molecules and 
methods that use the rAAV to produce the mutant Rep. Cell-free (in 
vitro) and intracellular methods are provided. Cells containing the rAAV 
are also provided. 

25 Among the Rep mutants provided herein, in addition to Rep 

mutants that enhance AAV production, are those that inhibit 
papillomavirus (PV) and PV-associated diseases, including certain cancers 
and human immunodeficiency virus (HIV) and HIV-associated diseases. 
Methods of treating such diseases are provided. 
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DESCRIPTION OF THE FIGURES 

FIGURE 1 shows the genetic map of AAV, including the location of 
promoters, and transcripts; amino acid 1 of the Rep 78 gene is at 
nucleotide 321 in the AAV-2 genome. 
5 FIGURES 2A and 2B depict "HITS" and "LEADS" respectively for 

identification of AAV rep mutants "evolved" for increased activity. 

FIGURES 3A and 3B show the alignment of amino acid sequences 
of Rep78 among AAV-1; AAV-6; AAV-3; AAV-3 B; AAV-4; AAV-2; AAV- 
5 sequences, respectively; the hit positions with 100 percent homology 
10 among the serotypes are bolded italics, where the position is different 
(compared to AAV-2, no. 6 in the Figure) in a particular serotype, it is in 
bold; a sequence indicating relative conservation of sequences among 
the serotypes is labeled "C". 

Legend: 

15 1 is AAV-1 ; 2 is AAV-6, 3 is AAV-3, 4 is AAV-3B, 

5 is AAV-4, 6 is AAV-2, and 7 is AAV-5; 

"." where the amino acid is present > 20%; 

":" where the amino acid is present > 40%; 

" + " where the amino acid is present > 60%; 
20 "*" where the amino acid is present > 80%; and 

where the amino acid is the same amongst all 

serotypes depicted it is represented by its single letter 

code. 

DETAILED DESCRIPTION 
25 A. Definitions 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of skill 
in the art to which this invention belongs. All patents, patent 
applications, published applications and publications, Genbank sequences, 
30 websites and other published materials referred to throughout the entire 
disclosure herein are, unless noted otherwise, incorporated by reference 
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in their entirety. In the event that there are a plurality of definitions for 
terms herein, those in this section prevail. 

As used herein, directed evolution refers to methods that adapt 
natural proteins or protein domains to work in new chemical or biological 
5 environments and/or to elicit new functions. It is more a more broad- 
based technology than DNA shuffling. 

As used herein, high-throughput screening (HTS) refers to 
processes that test a large number of samples, such as samples of test 
proteins or cells containing nucleic acids encoding the proteins of interest 
10 to identify structures of interest or to identify test compounds that 
interact with the variant proteins or cells containing them. HTS 
operations are amenable to automation and are typically computerized to 
handle sample preparation, assay procedures and the subsequent 
processing of large volumes of data. 
15 As used herein, DNA shuffling is a PCR-based technology that 

produces random rearrangements between two or more sequence-related 
genes to generate related, although different, variants of given gene. 

As used herein, "hits" are mutant proteins that have an alteration in 
any attribute, chemical, physical or biological property in which such 
20 alteration is sought. In the methods herein, hits are generally generated 
by systematically replacing each amino acid in a protein or a domain 
thereof with a selected amino acid, typically Alanine, Glycine, Serine or 
any amino acid, as long as each residue is replaced with the same 
residue. Hits may be generated by other methods known to those of skill 
25 in the art tested by the high-throughput methods herein. For purposes 
herein a Hit typically has activity with respect to the function of interest 
that differs by at least 10%, 20%, 30% or more from the wild type or 
native protein. The desired alteration, which is generally a reduction in 
activity, will depend upon the function or property of interest. 
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As used herein, "leads" are "hits" whose activity has been 
optimized for the particular attribute, chemical, physical or biological 
property. In the methods herein, leads are generally produced by 
systematically replacing the hit loci with all remaining 18 amino acids, and 
5 identifying those among the resulting proteins that have a desired activity. 
The leads may be further optimized by replacement of a plurality of "hit" 
residues. Leads may be generated by other methods known to those of 
skill in the art and tested by the highthroughput methods herein. For 
purposes herein a lead typically has activity with respect to the function 

10 of interest that differs from the native activity, by a desired amount and is 
at by at least 10%, 20%, 30% or more from the wild type or native 
protein. Generally a Lead will have an activity that is 2 to 10 or more 
times the native protein for the activity of interest. As with hits, the 
change in the activity is dependent upon the activity that is "evolved." 

15 The desired alteration will depend upon the function or property of 
interest. 

As used herein, MOI is multiplicity of infection. 

As used herein, ip, with reference to a virus or recombinant vector, 
refers to a titer of infectious particles. 
20 As used herein, pp refers to the total number of vector {or virus) 

physical particles 

As used herein, biological and pharmacological activity includes any 
activity of a biological pharmaceutical agent and includes, but is not 
limited to, biological efficiency, transduction efficiency, gene/transgene 
25 expression, differential gene expression and induction activity, titer, 
progeny productivity, toxicity, citotoxicity, immunogenicity, cell 
proliferation and/or differentiation activity, anti-viral activity, 
morphogenetic activity, teratogenetic activity, pathogenetic activity, 
therapeutic activity/ tumor supressor activity, ontogenetic activity, 
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oncogenetic activity, enzymatic activity, pharmacological activity, 
cell/tissue tropism and delivery. 

As used herein, "output signal" refers to parameters that can be 
followed over time and, if desired, quantified. For example, when a virus 
5 infects or is introduced into a cell, the cell containing the virus undergoes 
a number of changes. Any such change that can be monitored and used 
to assess infection, is an output signal, and the cell is referred to as a 
reporter cell; the encoding nucleic acid is referred to as a reporter gene, 
and the construct that includes the encoding nucleic acid is a reporter 

10 construct. Output signals include, but are not limited to, enzyme activity, 
fluorescence, luminescence, amount of product produced and other such 
signals. Output signals include expression of a viral gene or viral gene 
product, including heterologous genes (transgenes) inserted into the virus. 
Such expression is a function of time {"t") after infection, which in turn is 

15 related to the amount of virus used to infect the cell, and, hence, the 
concentration of virus ("s") in the infecting composition. For higher 
concentrations the output signal is higher. For any particular 
concentration, the output signal increases as a function of time until a 
plateau- is reached. Output signals may also measure the interaction 

20 between cells, expressing heterologous genes, and biological agents 

As used herein, adeno-associated virus (AAV) is a defective and 
non-pathogenic parvovirus that requires co-infection with either 
adenovirus or herpes virus for its growth and multiplication, able of 
providing helper functions. A variety of serotypes are known, and 

25 contemplated herein. Such serotypes include, but are not limited to: 

AAV-1 (Genbank accession no. NC002077; accession no. VR-645); AAV- 
2 (Genbank accession no. NC001401; accession no. VR-680); AAV-3 
(Genbank accession no. NC001729; accession no. VR-681); AAV-3b 
(Genbank accession no. NC001863); AAV-4 (Genbank accession no. 
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NC001829; ATCC accession no. VR-646 ); AAV-6 (Genbank accession 
no.NC001862); and avian associated adeno-virus (ATCC accession no. 
VR-1449). The preparation and use of AAVs as vectors for gene 
expression in vitro and for in vivo use for gene therapy are well known 
5 (see, e.g., U.S. Patent Nos. 4,797,368, 5,139,941, 5,798,390 and 
6,127,175; Tessier et al. (2001) J. Virol. 75:375-383; Salvetti et al. 
(1 998) Hum Gene Ther 20:695-706; Chadeuf et ai (2000) J Gene Med 
2:260-268). 

As used herein, the activity of a Rep protein or of a capsid protein 
10 refers to any biological activity that can be assessed. In particular, 

herein, the activity assessed for the rep proteins is the amount {i.e., titer) 
of AAV produced by a cell. 

As used herein, the Hill equation is a mathematical model that 
relates the concentration of a drug {i.e., test compound or substance) to 
15 the response being measured 



20 



-max 



y = [D] n + [D 50 ] n 



where y is the variable being measured, such as a response, signal, y max is 
the maximal response achievable, [D] is the molar concentration of a 
drug, [D 50 ] is the concentration that produces a 50% maximal response to 
the drug, n is the slope parameter, which is 1 if the drug binds to a single 

25 site and with no cooperativity between or among sites. A Hill plot is log 10 
of the ratio of ligand-occupied receptor to free receptor vs. log [D] (M). 
The slope is n, where a slope of greater than 1 indicates cooperativity 
among binding sites, and a slope of less than 1 can indicate heterogeneity 
of binding. This general equation has been employed for assessing 

30 interactions in complex biological systems (see, published International 
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PCT application No. WO 01/44809 based on PCT n° PCT/FROO/03503, 
see, also, EXAMPLES). 

As used herein, in the Hill-based analysis (published International 
PCT application No. WO 01/44809 based on PCT n° PCT/FROO/03503), 
5 the parameters, n,K,T,e,ri,Q, are as follows: 

n potency of the biological agent acting on the assay (cell- 
based) system; 

k constant of resistance of the assay system to elicit a 
response to a biological agent; 
10 6 is global efficiency of the process or reaction triggered by the 

biological agent on the assay system; 

t is the apparent titer of the biological agent; 

9 is the absolute titer of the biological agent; and 

q is the heterogeneity of the biological process or reaction. 
15 In particular, as used herein, the parameters rr (potency) or k 

(constant of resistance) are used to respectively assess the potency of a 
test agent to produce a response in an assay system and the resistance 
of the assay system to respond to the agent. 

As used herein, ^(efficiency), is the slope at the inflection point of 
20 the Hill curve (or, in general, of any other sigmoidal or linear 

approximation), to assess the efficiency of the global reaction (the 
biological agent and the assay system taken together) to elicit the 
biological or pharmacological response. 

As used herein, r (apparent titer) is used to measure the limiting 
25 dilution or the apparent titer of the biological agent. 

As used herein, 0 (absolute titer), is used to measure the absolute 
limiting dilution or titer of the biological agent. 

As used herein, rj (heterogeneity) measures the existence of 
discontinuous phases along the global reaction, which is reflected by an 
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abrupt change in the value of the Hill coefficient or in the constant of 
resistance. 

As used herein, a library of mutants refers to a collection of 
plasmids or other vehicles that carry (encode) the gene variants, such that 
5 individual plasmids or other vehicles carry individual gene variants. 
When a library of proteins is contemplated, it will be so-stated. 

As used herein, a "reporter cell" is the cell that "reports", i.e., 
undergoes the change, in response to introduction of the nucleic acid 
infection and, therefore, it is named here a reporter cell. 
10 As used herein, "reporter" or "reporter moiety" refers to any moiety 

that allows for the detection of a molecule of interest, such as a protein 
expressed by a cell. Reporter moieties include, but are not limited to, for 
example, fluorescent proteins, such as red, blue and green fluorescent 
proteins; lacZ and other detectable proteins and gene products. For 
15 expression in cells, nucleic acid encoding the reporter moiety can be 
expressed as a fusion protein with a protein of interest or under the 
control of a promoter of interest. 

As used herein, a titering virus increases or decreases the output 
signal from a reporter virus, which is a virus that can be detected, such 
20 as by a detectable label or signal. 

As used herein, phenotype refers to the physical, physiological or 
other manifestation of a genotype (a sequence of a gene). In methods 
herein, phenotypes that result from alteration of a genotype are assessed. 

As used herein, activity refers to the function or property to be 
25 evolved. An active site refers to a site{s) responsible or that participates 
in conferring the activity or function. The activity or active site evolved 
{the function or property and the site conferring or participating in 
conferring the activity) may have nothing to do with natural activities of 
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a protein. For example, it could be an 'active site' for conferring 
immunogenicity (immunogenic sites or epitopes) on a protein. 

As used herein, the amino acids, which occur in the various amino 
acid sequences appearing herein, are identified according to their known, 
5 three-letter or one-letter abbreviations (see, Table 1). The nucleotides, 
which occur in the various nucleic acid fragments, are designated with 
the standard single-letter designations used routinely in the art. 

As used herein, amino acid residue refers to an amino acid formed 
upon chemical digestion (hydrolysis) of a polypeptide at its peptide 

10 linkages. The amino acid residues described herein are presumed to be in 
the "L" isomeric form. Residues in the "D" isomeric form, which are so- 
designated, can be substituted for any L-amino acid residue, as long as 
the desired functional property is retained by the polypeptide. NH 2 refers 
to the free amino group present at the amino terminus of a polypeptide. 

15 COOH refers to the free carboxy group present at the carboxyl terminus 

of a polypeptide. In keeping with standard polypeptide nomenclature 

described in J. Bio/. Chem., 243:3552-59 (1969) and adopted at 37 

C.F.R. § § 1.821 - 1.822, abbreviations for amino acid residues are 

shown-in the following Table: 
20 Table 1 



Table of Correspondence 



SYMBOL 




1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Aia 


alanine 


S 


Ser 


serine 
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10 



15 



20 



25 



SYMBOL 




I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


Z 


Glx 


Glu and/or Gin 


w 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



It should be noted that ail amino acid residue sequences 
represented herein by formulae have a left to right orientation in the 
conventional direction of amino-terminus to carboxyl-terminus. In 
addition, the phrase "amino acid residue" is broadly defined to include the 
amino acids listed in the Table of Correspondence and modified and 
unusual amino acids, such as those referred to in 37 C.F.R. § § 1.821- 
1.822, and incorporated herein by reference. Furthermore, it should be 
noted that a dash at the beginning or end of an amino acid residue 
sequence indicates a peptide bond to a further sequence of one or more 
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amino acid residues or to an amino-terminal group such as NH 2 or to a 
carboxyl-terminal group such as COOH. 

In a peptide or protein, suitable conservative substitutions of amino 
acids are known to those of skill in this art and may be made generally 
5 without altering the biological activity of the resulting molecule. Those of 
skill in this art recognize that, in general, single amino acid substitutions 
in non-essential regions of a polypeptide do not substantially alter 
biological activity (see, e.g., Watson et al. Molecular Biology of the Gene, 
4th Edition, 1987, The Benjamin/Cummings Pub. co., p. 224). 
10 Such substitutions are preferably made in accordance with those 

set forth in TABLE 2 as follows: 

TABLE 2 





Original residue 


Conservative substitution 




Ala (A) 


Gly; Ser 


15 


Arg <R) 


Lys 




Asn (N) 


Gin; His 




Cys (C) 


Ser 




Gin (Q) 


Asn 




Glu (E) 


Asp 


20 


Gly (G) 


Ala; Pro 




His (H) 


Asn; Gin 




He (I) 


Leu; Val 




Leu (L) 


lie; Val 




Lys (K) 


Arg; Gin; Glu 


25 


Met <M) 


Leu; Tyr; He 




Phe (F) 


Met; Leu; Tyr 




Ser <S> 


Thr 




Thr (T) 


Ser 




Trp (W) 


Tyr 


30 


Tyr (Y) 


Trp; Phe 




Val (V) 


He; Leu 



Other substitutions are also permissible and may be determined 
empirically or in accord with known conservative substitutions. 

As used herein, nucleic acids include DNA, RNA and analogs 
35 thereof, including protein nucleic acids (PNA) and mixture thereof. 

Nucleic acids can be single or double stranded. When referring to probes 
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or primers, optionally labeled, with a detectable label, such as a 
fluorescent or radiolabel, single-stranded molecules are contemplated. 
Such molecules are typically of a length such that they are statistically 
unique of low copy number (typically less than 5, preferably less than 3) 
5 for probing or priming a library. Generally a probe or primer contains at 
least 14, 16 or 30 contiguous of sequence complementary to or identical 
to a gene of interest. Probes and primers can be 10, 14, 16, 20, 30, 50, 
100 or more nucleic acid bases long. 

As used herein, homologous means about greater than 25% nucleic 

10 acid sequence identity, preferably 25% 40%, 60%, 80%, 90% or 95%. 
The intended percentage will be specified. The terms "homology" and 
"identity" are often used interchangeably. In general, sequences are 
aligned so that the highest order match is obtained (see, e.g.: 
Computational Molecular Biology, Lesk, A.M., ed., Oxford University 

15 Press, New York, 1 988; Biocomputing: Informatics and Genome Projects, 
Smith, D.W., ed., Academic Press, New York, 1993; Computer Analysis 
of Sequence Data, Part 1 1 Griffin, A.M., and Griffin, H.G., eds., Humana 
Press, New Jersey, 1 994; Sequence Analysis in Molecular Biology, von 
Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 

20 Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; 
Carillo et al. ( 1 988) S/AM J Applied Math 48: 1073). By sequence 
identity/ the number of conserved amino acids are determined by standard 
alignment algorithms programs, and are used with default gap penalties 
established by each supplier. Substantially homologous nucleic acid 

25 molecules would hybridize typically at moderate stringency or at high 
stringency all along the length of the nucleic acid of interest. Also 
contemplated are nucleic acid molecules that contain degenerate codons 
in place of codons in the hybridizing nucleic acid molecule. 
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As used herein, a nucleic acid homolog refers to a nucleic acid that 
includes a preselected conserved nucleotide sequence, such as a 
sequence encoding a therapeutic polypeptide. By the term "substantially 
homologous" it is meant having at least 80%, preferably at least 90%, 
5 most preferably at least 95% homology therewith or a less percentage of 
homology or identity and conserved biological activity or function. 

The terms "homology" and "identity" are often used 
interchangeably. In this regard, percent homology or identity may be 
determined, for example, by comparing sequence information using a GAP 

10 computer program. The GAP program uses the alignment method of 

Needleman and Wunsch [J. Mol. Biol. 48:443 (1970), as revised by Smith 
and Waterman (Adv. Appl. Math. 2:482 (1981). Briefly, the GAP program 
defines similarity as the number of aligned symbols (i.e., nucleotides or 
amino acids) which are similar, divided by the total number of symbols in. 

15 the shorter of the two sequences. The preferred default parameters for 
the GAP program may include: (1) a unary comparison matrix (containing 
a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl: Acids Res. 14:6745 
(1986) 7 as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN 

20 SEQUENCE AND STRUCTURE, National Biomedical Research Foundation, 
pp. 353-358 (1979); (2) a penalty of 3.0 for each gap and an additional 
0.10 penalty for each symbol in each gap; and (3) no penalty for 
end gaps. 

Whether any two nucleic acid molecules have nucleotide sequences 
25 that are, for example, at least 80%, 85%, 90%, 95%, 96%, 97%, 98% 
or 99% /'identical" can be determined using known computer algorithms 
such as the "FAST A" program, using for example, the default parameters 
as in Pearson and Lipman, Proc. Natl. Acad. Sci. USA 55:2444. (.1988). 
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Alternatively the BLAST function of the National Center for Biotechnology 
Information database may be used to determine identity 

In general, sequences are aligned so that the highest order match 
is obtained. "Identity" per se has an art-recognized meaning and can be 
5 calculated using published techniques. (See, e.g.: Computational 

Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 
1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, 
Part I, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 

10 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 
Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, 1991). While there exist a number 
of methods to measure identity between two polynucleotide or 
polypeptide sequences, the term "identity" is well known to skilled 

15 artisans (Carillo, H. & Upton, D., SIAM J Applied Math 45:1073 (1988)). 
Methods commonly employed to determine identity or similarity between 
two sequences include, but are not limited to, those disclosed in Guide to 
Huge Computers, Martin J. Bishop, ed., Academic Press, San Diego, 
1994, and Carillo, H. & Lipton, D., SIAM J Applied Math 45:1073 

20 (1988). Methods to determine Identity and similarity are codified in 

computer programs. Preferred computer program methods to determine 
identity and similarity between two sequences include, but are not limited 
to, GCG program package (Devereux, J., et al. r Nucleic Acids Research 
72(/):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., eta/., J 

25 Molec Biol 275:403 (1990)), and CLUSTALW. For sequences displaying 
a relatively high degree of homology, alignment can be effected manually 
by simpling lining up the sequences by eye and matching the conserved 
portions. 
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Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 
For example, a test polypeptide may be defined as any polypeptide that 
is 90% or more identical to a reference polypeptide. 
5 For the alignments presented herein (see, Figures 3A and 3B) for 

the AAV serotype, the CLUSTALW program was employed with - 
parameters set as follows: scoring matrix BLOSUM, gap open 10, gap 
extend 0.1, gap distance 40% and transitions/transversions 0.5; specific 
residue penalties for hydrophobic amino acids (DEGKNPQRS), distance 
10 between gaps for which the penalties are augmented was 8, and gaps of 
extemeties penalized less than internal gaps. 

As used herein, a "corresponding" position on a protein, such as 
the AAV rep protein, refers to an amino acid position based upon 
alignment to maximize sequence identity. For AAV Rep proteins an *• 
15 alignment of the Rep 78 protein from AAV-2 and the corresponding 
protein from other AAV serotypes (AAV-1, AAV-6, AAV-3, AAV-3B, 
AAV-4, AAV-2 and AAV-5) is shown in Figures 3A and 3B. The "hit" 
positions are shown in italics. 

As used herein, the term at least "90% identical to" refers to 
20 percent identities from 90 to 100% relative to the reference polypeptides. 
Identity at a level of 90% or more is indicative of the fact that, assuming 
for exemplification purposes a test and reference polynucleotide length of 
100 amino acids are compared. No more than 10% (i.e., 10 out of 100) 
amino acids in the test polypeptide differs from that of the reference 
25 polypeptides. Similar comparisons may be made between a test and 

reference polynucleotides. Such differences may be represented as point 
mutations randomly distributed over the entire length of an amino acid 
sequence or they may be clustered in one or more locations of varying 
length up to the maximum allowable, e.g. 10/100 amino acid difference 
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(approximately 90% identity). Differences are defined as nucleic acid or 
amino acid substitutions, or deletions. 

As used herein, it is also understood that the terms substantially 
identical or similar varies with the context as understood by those skilled 
5 in the relevant art. 

As used herein, genetic therapy involves the transfer of 
heterologous nucleic acids to the certain cells, target cells, of a mammal, 
particularly a human, with a disorder or conditions for which such therapy 
is sought. The nucleic acid, such as DNA, is introduced into the selected 

10 target cells in a manner such that the heterologous nucleic acid, such as 
DNA, is expressed and a therapeutic product encoded thereby is 
produced. Alternatively, the heterologous nucleic acid, such as DNA, 
may in some manner mediate expression of DNA that encodes the 
therapeutic product, or it may encode a product, such as a peptide or 

15 RNA that in some manner mediates, directly or indirectly, expression of a 
therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product that replaces a defective gene or 
supplements a gene product produced by the mammal or the cell in which 
it is introduced. The introduced nucleic acid may encode a therapeutic 

20 compound, such as a growth factor or inhibitor thereof, or a tumor 

necrosis factor or inhibitor thereof, such as a receptor therefor, that is not 
normally produced in the mammalian host or that is not produced in 
therapeutically effective amounts or at a therapeutically useful time. The 
heterologous nucleic acid, such as DNA, encoding the therapeutic product 

25 may be modified prior to introduction into the cells of the afflicted host in 
order to enhance or otherwise alter the product or expression thereof. 
Genetic therapy may also involve delivery of an inhibitor or repressor or 
other modulator of gene expression. 
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As used herein, heterologous or foreign nucleic acid, such as DNA 
and RNA, are used interchangeably and refer to DNA or RNA that does 
not occur naturally as part of the genome in which it is present or which 
is found in a location or locations in the genome that differ from that in 
5 which it occurs in nature. Heterologous nucleic acid is generally not 
endogenous to the cell into which it is introduced, but has been obtained 
from another cell or prepared synthetically. Generally, although not 
necessarily, such nucleic acid encodes RNA and proteins that are not 
normally produced by the cell in which it is expressed. Any DNA or RNA 

10 that one of skill in the art would recognize or consider as heterologous or 
foreign to the cell in which it is expressed is herein encompassed by 
heterologous DNA. Heterologous DNA and RNA may also encode RNA or 
proteins that mediate or alter expression of endogenous DNA by affecting 
transcription, translation, or other regulatable biochemical processes. 

15 Examples of heterologous nucleic acid include, but are not limited to, 
nucleic acid that encodes traceable marker proteins, such as a protein 
that confers drug resistance, nucleic acid that encodes therapeutically 
effective substances, such as anti-cancer agents, enzymes and hormones, 
and DNA that encodes other types of proteins, such as antibodies. 

20 Hence, herein heterologous DNA or foreign DNA, includes a DNA 

molecule not present in the exact orientation and position as the 
counterpart DNA molecule found in the genome. It may also refer to a 
DNA molecule from another organism or species (i.e., exogenous). 

As used herein, a therapeutically effective product introduced by 

25 genetic therapy is a product that is encoded by heterologous nucleic acid, 
typically DNA, that, upon introduction of the nucleic acid into a host, a 
product is expressed that ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures the 
disease. 
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As used herein, a therapeutically effective dose refers to that 
amount of the compound sufficient to result in amelioration of symptoms 
of disease. 

As used herein, isolated with reference to a nucleic acid molecule 
5 or polypeptide or other biomolecule means that the nucleic acid or 

polypeptide has separated from the genetic environment from which the 
polypeptide or nucleic acid were obtained. It may also mean altered from 
the natural state. For example, a polynucleotide or a polypeptide naturally 
present in a living animal is not "isolated," but the same polynucleotide or 

10 polypeptide separated from the coexisting materials of its natural state is 
"isolated", as the term is employed herein. Thus, a polypeptide or 
polynucleotide produced and/or contained within a recombinant host cell 
is considered isolated. Also intended as an "isolated polypeptide" or an 
"isolated polynucleotide" are polypeptides or polynucleotides that have 

15 been purified, partially or substantially, from a recombinant host cell or 
from a native source. For example, a recombinantly produced version of 
a compounds can be substantially purified by the one-step method 
described in Smith and Johnson, Gene 67:31-40 (1988). The terms 
isolated and purified are sometimes used interchangeably. 

20 Thus, by "isolated" it is meant that the nucleic is free of the coding 

sequences of those genes that, in the naturally-occurring genome of the 
organism (if any), immediately flank the gene encoding the nucleic acid of 
interest. Isolated DNA may be single-stranded or double-stranded, and 
may be genomic DNA, cDNA, recombinant hybrid DNA, or synthetic 

25 DNA. It may be identical to a native DNA sequence, or may differ from 
such sequence by the deletion, addition, or substitution of one or more 
nucleotides. 

Isolated or purified as it refers to preparations made from biological 
cells or hosts means any cell extract containing the indicated DNA or 
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protein including a crude extract of the DNA or protein of interest. For 
example, in the case of a protein, a purified preparation can be obtained 
following an individual technique or a series of preparative or biochemical 
techniques and the DNA or protein of interest can be present at various 
5 degrees of purity in these preparations. The procedures may include for 
example, but are not limited to, ammonium sulfate fractionation, gel 
filtration, ion exchange change chromatography, affinity chromatography, 
density gradient centrifugation and electrophoresis. 

A preparation of DNA or protein that is "substantially pure" or 
10 "isolated" should be understood to mean a preparation free from naturally 
occurring materials with which such DNA or protein is normally 
associated in nature. "Essentially pure" should be understood to mean a 
"highly" purified preparation that contains at least 95% of the DNA or 
protein of interest. 

15 A cell extract that contains the DNA or protein of interest should be 

understood to mean a homogenate preparation or cell-free preparation 
obtained from cells that express the protein or contain the DNA of 
interest. The term "cell extract" is intended to include culture media, 
especially spent culture media from which the cells have been removed. 

20 As used herein, receptor refers to a biologically active molecule 

that specifically binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous 
nature of a specific receptor. 

As used herein, recombinant refers to any progeny formed as the 

25 result of genetic engineering. 

As used herein, a promoter region refers to the portion of DNA of a 
gene that controls transcription of the DNA to which it is operatively 
linked. The promoter region includes specific sequences of DNA that are 
sufficient for RNA polymerase recognition, binding and transcription 
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initiation. This portion of the promoter region is referred to as the 
promoter. In addition, the promoter region includes sequences that 
modulate this recognition, binding and transcription initiation activity of 
the RNA polymerase. These sequences may be c/s acting or may be 
5 responsive to trans acting factors. Promoters, depending upon the nature 
of the regulation, may be constitutive or regulated. 

As used herein, the phrase "operatively linked" generally means the 
sequences or segments have been covalently joined into one piece of 
DNA, whether in single or double stranded form, whereby control or 

10 regulatory sequences on one segment control or permit expression or 
replication or other such control of other segments. The two segments 
are not necessarily contiguous. For gene expression a DNA sequence and 
a regulatory sequence(s) are connected in such a way to control or permit 
gene expression when the appropriate molecules, e.g., transcriptional 

15 activator proteins, are bound to the regulatory sequence(s). 

As used herein, production by recombinant means by using 
recombinant DNA methods means the use of the well known methods of 
molecular biology for expressing proteins encoded by cloned DNA, 
including cloning expression of genes and methods, such as gene 

20 shuffling and phage display with screening for desired specificities. 

As used herein, a splice variant refers to a variant produced by 
differential processing of a primary transcript of genomic DNA that results 
in more than one type of mRNA. 

As used herein, a composition refers to any mixture of two or more 

25 products or compounds. It may be a solution, a suspension, liquid, 
powder, a paste, aqueous, non-aqueous or any combination thereof. 
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As used herein, a combination refers to any association between 
two or more items. 

As used herein, substantially identical to a product means 
sufficiently similar so that the property of interest is sufficiently 
5 unchanged so that the substantially identical product can be used in place 
of the product. 

As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. 
One type of preferred vector is an episome, i.e., a nucleic acid capable of 

10 extra-chromosomal replication. Preferred vectors are those capable of 
autonomous replication and/or expression of nucleic acids to which they 
are linked. Vectors capable of directing the expression of genes to which 
they are operatively linked are referred to herein as "expression vectors". 
In general, expression vectors of utility in recombinant DIMA techniques 

15 are often in the form of "plasmids" which refer generally to circular 

double stranded DNA loops which, in their vector form, are not bound to 
the chromosome. "Plasmid" and "vector" are used interchangeably as the 
plasmid is the most commonly used form of vector. Other such other 
forms of expression vectors that serve equivalent functions and that 

20 become known in the art can be used subsequently hereto. 

As used herein, vector is also used interchangeable with "virus 
vector" or "viral vector". In this case, which will be clear from the 
context, the "vector" is not self-replicating. Viral vectors are engineered 
viruses that are operatively linked to exogenous genes to transfer (as 

25 vehicles or shuttles) the exogenous genes into cells. 

As used herein, transduction refers to the process of gene transfer 
and expression into mammalian and other cells mediated by viruses. 
Transfection refers to the process when mediated by plasmids. 
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As used herein, "polymorphism" refers to the coexistence of more 
than one form of a gene or portion thereof. A portion of a gene of which 
there are at least two different forms, i.e., two different nucleotide 
sequences, is referred to as a "polymorphic region of a gene". A 
5 polymorphic region can be a single nucleotide, referred to as a single 
nucleotide polymorphism (SNP), the identity of which differs in different 
alleles. A polymorphic region can also be several nucleotides in length. 

As used herein, "polymorphic gene" refers to a gene having at least 
one polymorphic region. 
10 As used herein, "allele", which is used interchangeably herein with 

"allelic variant" refers to alternative forms of a gene or portions thereof. 
Alleles occupy the same locus or position on homologous chromosomes. 
When a subject has two identical alleles of a gene, the subject is said to 
be homozygous for the gene or allele. When a subject has two different 
15 alleles of a gene, the subject is said to be heterozygous for the gene. 

Alleles of a specific gene can differ from each other in a single nucleotide, 
or several nucleotides, and can include substitutions, deletions, and 
insertions of nucleotides. An allele of a gene can also be a form of a gene 
containing a mutation. 
20 As used herein, the term "gene" or "recombinant gene" refers to a 

nucleic acid molecule comprising an open reading frame and including at 
least one exon and (optionally) an intron sequence. A gene can be either 
RNA or DNA. Genes may include regions preceding and following the 
coding region (leader and trailer). 
25 As used herein, "intron" refers to a DNA sequence present in a 

given gene which is spliced out during mRNA maturation. 

As used herein, "nucleotide sequence complementary to the 
nucleotide sequence set forth in SEQ ID NO: x" refers to the nucleotide 
sequence of the complementary strand of a nucleic acid strand having 



WO 03/018820 



PCT/IB02/04087 



-27- 

SEQ ID NO: x. The term "complementary strand" is used herein 
interchangeably with the term "complement". The complement of a 
nucleic acid strand can be the complement of a coding strand or the 
complement of a non-coding strand. When referring to double stranded 
5 nucleic acids, the complement of a nucleic acid having SEQ ID NO: x 
refers to the complementary strand of the strand having SEQ ID NO: x or 
to any nucleic acid having the nucleotide sequence of the complementary 
strand of SEQ ID NO: x. When referring to a single stranded nucleic acid 
having the nucleotide sequence SEQ ID NO: x, the complement of this 
10 nucleic acid is a nucleic acid having a nucleotide sequence which is 
complementary to that of SEQ ID NO: x. 

As used herein, the term "coding sequence" refers to that portion 
of a gene that encodes an amino acid sequence of a protein. 

As used herein, the term "sense strand" refers to that strand of a 
15 double-stranded nucleic acid molecule that has the sequence of the 
mRNA that encodes the amino acid sequence encoded by the double- 
stranded nucleic acid molecule. 

As used herein, the term "antisense strand" refers to that strand of 
a double-stranded nucleic acid molecule that is the complement of the 
20 sequence of the mRNA that encodes the amino acid sequence encoded 
by the double-stranded nucleic acid molecule. 

As used herein, an array refers to a collection of elements, such as 
nucleic acid molecules, containing three or more members. An 
addressable array is one in which the members of the array are 
25 identifiable, typically by position on a solid phase support or by virtue of 
an identifiable or detectable label, such as by color, fluorescence, 
electronic signal (i.e. RF, microwave or other frequency that does not 
substantially alter the interaction of the molecules of interest), bar code or 
other symbology, chemical or other such label. Hence, in general the 
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members of the array are immobilized to discrete identifiable loci on the 
surface of a solid phase or directly or indirectly linked to or otherwise 
associated with the identifiable label, such as affixed to a microsphere or 
other particulate support (herein referred to as beads) and suspended in 
5 solution or spread out on a surface. 

As used herein, a support (also referred to as a matrix support, a 
matrix, an insoluble support or solid support) refers to any solid or 
semisolid or insoluble support to which a molecule of interest, typically a 
biological molecule, organic molecule or biospecific ligand is linked or 

10 contacted. Such materials include any materials that are used as affinity 
matrices or supports for chemical and biological molecule syntheses and 
analyses, such as, but are not limited to: polystyrene, polycarbonate, 
polypropylene, nylon, glass, dextran, chitin, sand, pumice, agarose, 
polysaccharides, dendrimers, buckyballs, polyacrylamide, silicon, rubber, 

15 and other materials used as supports for solid phase syntheses, affinity 
separations and purifications, hybridization reactions, immunoassays and 
other such applications. The matrix herein can be particulate or can be 
in the form of a continuous surface, such as a microtiter dish or well, a 
glass slide, a silicon chip, a nitrocellulose sheet, nylon mesh, or other 

20 such materials. When particulate, typically the particles have at least one 
dimension in the 5-10 mm range or smaller. Such particles, referred 
collectively herein as "beads", are often, but not necessarily, spherical. 
Such reference, however, does not constrain the geometry of the matrix, 
which may be any shape, including random shapes, needles, fibers, and 

25 elongated. Roughly spherical "beads", particularly microspheres that can 
be used in the liquid phase, are also contemplated. The "beads" may 
include additional components, such as magnetic or paramagnetic 
particles (see, e.g.,, Dyna beads (Dynal, Oslo, Norway)) for separation 
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using magnets, as long as the additional components do not interfere with 
the methods and analyses herein. 

As used herein, matrix or support particles refers to matrix 
materials that are in the form of discrete particles. The particles have any 
5 shape and dimensions, but typically have at least one dimension that is 
100 mm or less, 50 mm or less, 10 mm or less, 1 mm or less, 100//m or 
less, 50 fjrc\ or less and typically have a size that is 100 mm 3 or less, 50 
mm 3 or less, 10 mm 3 or less, and 1 mm 3 or less, 100//m 3 or less and may 
be order of cubic microns. Such particles are collectively called "beads." 
10 As used herein, the abbreviations for any protective groups, amino 

acids and other compounds, are, unless indicated otherwise, in accord 
with their common usage, recognized abbreviations, or the IUPAC-IUB 
Commission on Biochemical Nomenclature (see, (1972) Biochem. 
7 7:942-944). 
15 B. DIRECTED EVOLUTION OF A VIRAL GENE 

Recombinant viruses have been developed for use as gene therapy 
vectors. Gene therapy applications are hampered by the need for 
development of vectors with traits optimized for this application. The 
high throughput methods provided herein are ideally suited for 
20 development of such vectors. In addition to use for development of 
recombinant viral vectors for gene therapy, these methods can also be 
used to study and modify the viral vector backbone architecture, trans- 
complementing helper functions, where appropriate, regulatable and 
tissue specific promoters and transgene and genomic sequence analyses. 
25 Recombinant AAV (rAAV) is a gene therapy vector that can serve these 
and other purposes. 

The rep protein is an adeno-associated virus protein involved in a 
number of biological processes necessary to AAV replication. The 
production of the rRep proteins enables viral DNA to replicate, 
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encapsulate and integrate (McCarty eta/. (1992) J. Virol 66:4050-4057; 
Horer et al. (1995) J. Virol 63:5485-5496, Berns et al. (1996) Biology of 
Adeno-associated virus, in Adeno-associated virus (AAV) Vectors in Gene 
Therapy, K.I. Berns and C. Giraud, Springer (1996); and Chiorini et al. 
5 (1996) The Roles of AAV Rep Proteins in gene Expression and Targeted 
Integration, from Adeno-associated virus (AAV) Vectors in Gene Therapy, 
K.I. Berns and C. Giraud, Springer (1996)). A rep protein with improved 
activity could lead to increased amounts of virus progeny thus allowing 
higher productivity of rAAV vectors. 

10 Since the Rep protein is involved in replication it can serve as a 

target for increasing viral production. Since it has a variety of functions 
and its role in replication is complex, it has heretofore been difficult to 
identify mutations that result in increase viral production. The methods 
herein, which rely on in vivo screening methods, permit optimization of its 

15 activities as assessed by increases in viral production. Provided herein 
are Rep proteins and viruses and viral vectors containing the mutated Rep 
proteins that provide such increase. The amino acid positions on the rep 
proteins that are relevant for rep proteins activities in terms of AAV or 
rAAV virus production are provided. Those amino acid positions are such 

20 that a change in the amino acid leads to a change in protein activity either 
to lower activity or increase activity. As shown herein, the alanine or 
amino acid scan revealed the amino acid positions important for such 
activity (i.e. hits). Subsequent mutations produced by systematically 
replacing the amino acids at the hit positions with the remaining 1 8 amino 

25 acids produced so-called "leads" that have amino acid changes and result 
in higher virus production. In this particular example, the method used 
included the following specific steps. 
Amino acid scan 
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ln order to first identify those amino acid (aa) positions on the rep 
protein that are involved in rep protein activity, an Ala-scan was 
performed on the rep sequence. For this, each aa in the rep protein 
sequence was individually changed to Alanine. Any other amino acid, 
5 particularly another amino acid such as Gly or Ser that has a neutral 
effect on structure, could have been used. Each resulting mutant rep 
protein was then expressed and the amount of virus it produced was 
measured. The relative activity of each individual mutant compared to 
the native protein is indicated in FIG 2A. HITS are those mutants that 
10 produce a decrease in the activity of the protein (in the example: all the 
mutants with activities below about 20 % of the native activity). 

In a second experimental round, which included a new set of 
mutations and phenotypic analysis, each amino acid position hit by the 
Ala-scan step, was mutated by amino acid replacement of the native 
15 amino acid by the remaining 18 amino acids, using site-directed 
mutagenesis. 

In both rounds, each mutant was individually designed, generated 
and processed separately, and optionally in parallel with the other 
mutants. Neither combinatorial generation of mutants nor mixtures 

20 thereof were used in any step of the method. 

A plasmid library was thus generated in which each plasmid contained 
a different mutant bearing a different amino acid at a different hit 
position. Again, each resulting mutant rep protein was then expressed 
and the amount of virus it could produce measured as indicated below. 

25 The relative activity of each individual mutant compared to the native 

protein is indicated in FIGURE 2B. LEADS are those mutants that lead to 
an increase in the activity of the protein (in the example: the ten mutants 
with activities higher, typically between 2 to 10 times or more, generally 
6-10 time, than the native activity). 
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Expression of the genetic variants and phenotypic characterization. 
The rep protein acts as an intracellular protein through complex 
interaction with a molecular network composed by cellular proteins, DNA, 
AAV proteins and adenoviral proteins (note: some adenovirus proteins 
5 have to be present for the rep protein to work). The final outcome of the 
rep protein activity is the virus offspring composed by infectious rAAV 
particles. It can be expected that the activity of rep mutants would affect 
the titer of the rAAV virus coming out of the cells. 

As the phenotypic characterization of the rep variants can only be 
10 accomplished by assaying its activity from inside mammalian cells, a 
mammalian cell-based expression system as well as a mammalian cell- 
based assay was used. The individual rep protein variants were expressed 
in human 293 HEK cells, by transfection of the individual plasmids 
constituting the diverse plasmid library. All necessary functions were 
15 provided as follows: 

(a) the cellular proteins present in the permissive specific 293 HEK 

cells; 

(b) the AAV necessary proteins and DNA were provided by co- 
transfection of the AAV cap gene as well as a rAAV plasmid vector 

20 providing the necessary signaling and substrate ITRs sequences; 

(c) the adenovirus (AV) proteins were provided by co-transfection 
with a plasmid expressing all the AV helper functions. 

A library of recombinant viruses with mutant rep encoding genes 
was generated. Each recombinant, upon introduction into a mammalian 
25 cell and expression resulted in production of rAAV infectious particles. 
The number of infectious particles produced by each recombinant was 
determined in order to assess the activity of the rep variant that had 
generated that amount of infectious particles. 
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The number of infectious particles produced was determined in a 
cell-based assay in which the activity of a reporter gene, in the 
exemplified embodiment, the bacterial lacZ gene, or virus replication (Real 
time PCR) was performed to quantitatively assess the number of viruses. 
5 The limiting dilution (titer) for each virus preparation (each coming from a 
different rep variant) was determined by serial dilution of the viruses 
produced, followed by infection of appropriate cells (293 HEK or HeLa 
rep/cap 32 cells) with each dilution for each virus and then by 
measurement of the activity of the reporter gene for each dilution of each 

10 virus. Hill plots (NAUTSCAISf ) (published as International PCT application 
No. WO 01/44809 based on PCT n° PCT/FROO/03503, Dec, 2000; see 
EXAMPLES) or a second order polynomial function (Drittanti et al. (2000) 
Gene Then 7: 924-929; see co-pending U.S. provisional application Serial 
60/315,382) was used to analyze the readout data and to calculate the 

15 virus titers. Briefly, the titer was calculated from the second order 

polynomial function by non-linear regression fitting of the experimental 

data. The point where the polynomial curve reaches its minimum is 

considered to be the titer of the rAAV preparation. Results are shown in 

the EXAMPLE below. 

20 Comparison between results of full-length Hit position analysis 

reporter here and the literature 

The experiments identified a number of heretofore unknown 
mutation loci, which include the hits at positions: 4, 20, 22, 28, 32, 38, 
39, 54, 59, 124, 125, 127, 132, 140, 161, 163, 193, 196, 197, 221, 
25 228, 231, 234, 258, 260, 263, 264, 334, 335, 341, 342, 347, 350, 
354, 363, 364, 367, 370, 376, 381, 389, 407, 411, 414, 420, 421, 
422, 428, 429, 438, 440, 451, 460, 462, 484,. 488, 495, 497, 498, 
499, 503, 511, 512, 516, 517 and 518 with reference to the amino 
acids in Rep78 and Rep 68. Rep 78 is encoded by nucleotides 321- 
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2,186; Rep 68 is encoded by nucleotides 321-1906 and 2228-2252; Rep 
52 is encoded by nucleotides 993-2186, and Rep 40 is encoded by amino 
acids 993-1 906 and 2228-2252 of wildtype AAV. 

Also among these are mutations that may have multiple effects. 
5 Since the Rep coding region is quite complex, some of the mutations 
have several effects. Amino acids 542, 598, 600 and 601, which are in 
the to the Rep 68 and 40 intron region, are also in the coding region of 
Rep 78 and 52. Codon 630 is in the coding region of Rep 68 and 40 and 
non coding region of Rep 78 and 52. 

10 Mutations at 10, 86, 101, 334 and 519 have been previously 

identified, and mutations, at loci 64, 74, 88, 1 75, 237, 250 and 429, but 
with different amino acid substitutions, have been previously reported. In 
all instances, however, the known mutations reportedly decrease the 
activity of Rep proteins. Among mutations described herein, are 

15 mutations that result in increases in the activity the Rep function as 
assessed by detecting increased AAV production. 

In particular, as described in the Example, mutations in the Rep- 
encoding region of AAV, including serotypes AAV-1, AAV-2, AAV-3, 
AAV-3B, AAV-4, AAV-5 and AAV-6 are provided (see Example below). 

20 The mutant proteins and mutant adeno-associate virus (AAV) Rep 

proteins are provided. Exemplary proteins with mutations at one or more 
of residues 4, 20, 22, 29, 32, 38, 39, 54, 59, 124, 125, 127, 132, 140, 
161, 163, 193, 196, 197, 221, 228, 231, 234, 258, 260, 263, 264, 
334, 335, 337, 342, 347, 350, 354, 363, 364, 367, 370, 376, 381, 

25 389, 407, 41 1, 414, 420, 421, 422, 424, 428, 438, 440, 451, 460, 
462, 484, 488, 495, 497, 498, 499, 503, 511, 512, 516, 517, 518, 
542, 548, 598, 600 and 601 of AAV-2 or the corresponding residues in 
other serotypes are provided. Residue 1 corresponds to residue 1 of the 
Rep78 protein encoded by nucleotides 321-323 of the AAV-2 genome 
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(see Figure 3 and the Table below for an alignment of the mutations from 
various serotypes). 

Of particular interest are mutations that increase activity of the Rep 
proteins compared to wildtype. Such mutations include one or more of 
5 residues 350, 462, 497, 517, 542, 548, 598, 600 and 630 of AAV-2 
and the corresponding residues in other serotypes. Also provided are 
mutations at or near those residues, such as within about 1 to about 10 
residues of these residues such that the resulting protein has increased 
activity. Mutations include insertions, deletions and replacements. 

10 Lead identification. 

Based on the results obtained from the assays described herein (i.e. 
titer of virus produced by each rep variant), each individual rep variant 
was assigned a specific activity. Those variant proteins displaying the 
highest titers were selected as leads and are used to produce rAAV. 

15 In further steps, rAAV and Rep proteins that contain a plurality of 

mutations based on the hits (see Table in the EXAMPLE, listing the hits 
and lead sites), are produced to produce rAAV and Rep proteins that have 
activity that is further optimized. Examples of such proteins and AAV 
containing such proteins are described in the EXAMPLE. Other 

20 combinations of mutations can be prepared and tested as described herein 
to identify other leads of interest, particularly those that have increased 
Rep protein activity or that result in higher viral titers in cells containing 
such viruses that include appropriate cis acting elements for viral 
production. 

25 The rAAV rep mutants are used as expression vectors, which, for 

example, can be used transiently for the production of recombinant AAV 
stocks. Alternatively, the recombinant plasmids may be used to generate 
stable packaging cell lines. 
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AIso among the uses of rAAV, particularly the high titer stocks 
produced herein, is gene therapy for the purpose of transferring genetic 
information into appropriate host cells for the management and correction 
of human diseases including inherited and acquired disorders such as 
5 cancer and AIDS. The rAAV can be administered to a patient at 
therapeutically effective doses. 
C. Uses of the mutant Rep genes and the rAAV 

Gene therapy 

The rAAV provided herein are intended for use as vectors for gene 

10 therapy. The rAAV provided herein are intended for use in any gene 

■ 

therapy protocol the uses AAV as a vector. The mutant Rep proteins and 
nucleic acid molecules can be used to replace the corresponding gene in 
other AAV vectors. Of interest are the mutations provided herein that 
increase rAAV production. In particular, the mutant Rep proteins are used 
15 to increase production of rAAV derived from any of the AAV serotypes, 
including AAV-1, AAV-2, AAV-3, AAV-3B, AAV-4, AAV-5 and AAV-6 
serotypes. 

Toxicity and therapeutic efficacy of the rAAV can be determined by 
standard pharmaceutical procedures in cell cultures or experimental 

20 animals, e.g., for determining the LDS 50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of 
the population). The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the ratio LD 50 /ED 50 . 
Doses that exhibit large therapeutic indices are preferred. Doses that 

25 exhibit toxic side effects may be used; care should be taken to design a 
delivery system that targets rAAV to the site of treatment in order to 
minimize damage to untreated cells and reduce side effects. 

The data obtained from cell culture assays and animal studies can 
be used in formulating a range of dosage for use in humans. The dosage 
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of such rAAV lies preferably within a range of circulating concentrations 

that include the ED 50 with little or no toxicity. The dosage may vary 

within this range depending upon the dosage form employed and the 

route of administration utilized. A therapeutically effective dose can be 

5 estimated initially from cell culture assays. A dose may be formulated in 

animal models to achieve a circulating plasma concentration range that 

includes the IC 50 (ie., the concentration of the test compound which 

achieves a half-maximal infection or a half- maximal inhibition) as 

determined in cell culture. Such information can be used to more 

10 accurately determine useful doses in humans. Levels in plasma may be 

measured, for example, by high performance liquid chromatography. 

Treatment of Cancer, HIV, and papilloma and herpes virus 
infections and diseases mediated thereby 

AAV, which is a helper-dependent parvovirus requires co-infection 
15 with an adenovirus, herpes virus or papilloma virus (PV) for replication 
and particle formation. AAV inhibits PV-induced oncogenic 
tansformation, and this inhibition has been mapped to the Rep78 protein. 
The Rep78 protein inhibits expression of the PV promoter just upstream 
of the E6 gene (p89 of bovine PV-1 (BPV-1)) p97 of human PV-16 (HPV- 
20 16), and p 105 of human PV-1 8 (HPV-18)). DNA binding is required for 
this inhibition. Rep78 also binds to the TAR sequences (nt +23 to +42) 
and to a region just upstream of the TATA box (nt. -54 to -34) in the HIV 
LTR region. AAV Rep78 also regulates a variety of other cancer 
associated genes, including, but are not limited to, C-H-ras (Khleif et al. 
25 (1991) Virology 757:738-741), c-fos and c-myc (Hermonat (1994) 
Cancer Lttrs 57:129-136). 

Infection by AAV is negatively associated with cervical cancer. 
Infection and DNA integration by certain PV types are central events in 
the etiology of cervical cancer (Durst et al. (1983) Proc. Natl. Acad. Sci. 
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U.S.A. 50:3812-3815; Cullen et al. (1991) J. ViroL 55:606-612). 
Roughly two thirds of cervical cancers contain the HPV-16 virus. AAV is 
also commonly found in the anogenital region (Han et al. (1996) Virus 
Genes 72:47-52. 

5 Contemplated herein are AAV rep mutants that bind with greater 

affinity than wild-type AAV Rep78 to nucleic acid from PV, AAV, 
oncogenes or HIV, particularly HIV-1, and particularly promoter and other 
transcriptional/translational regulatory sequences from these sources. 
The mutant Rep protein when administered to a subject can inhibit PV 

10 and PV-associated diseases, HIV and HIV-associated diseases. Hence 
methods for treatment of PV and HIV-mediated disorders by 
administration of rAAV encoding mutant the Rep78 genes are provided. 
The particular mutants for use in these methods can be identified by 
testing each mutant for inhibitory activity, for example, in cell-based 

15 assays. For example, the Rep mutant protein can be tested by contacting 
it with nucleic acid from a PV, AAV or HIV or oncogene for a time 
sufficient to permit binding thereto, and comparing such binding to the 
binding of a wild-type Rep protein under the same conditions. 
Alternatively competitive binding assays may be performed. Mutant 

20 proteins having higher binding affinities are identified. 

Fusion proteins containing a tat protein of HIV or other targeting 
agent and mutant Rep protein are also provided. Pharmaceutical 
compositions containing such fusion proteins are provided. The fusion 
proteins can contain additional components, such as E. coli maltose 

25 binding protein (MBP) that aid in uptake of the protein by cells (see, 
International PCT application No. WO 01/3271 1). Nucleic acid 
molecules encoding the mutant Rep protein or fusion protein operably 
linked to a promoter, such as an inducible promoter for expression in 
mammalian cells are also provided. Such promoters include, but are not 
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limited to, CMV and SV40 promoters; adenovirus promoters, such as the 
E2 gene promoter, which is responsive to the HPV E7 oncoprotein; a PV 
promoter, such as the PBV p89 promoter that is responsive to the PV E2 
protein; and other promoters that are activated by the HIV or PV or 
5 oncogenes. 

The mutant rep proteins are also delivered to the cells in rAAV or a 
portion thereof that can additionally encode therapeutic agents for 
treatment of the cancer or HIV infection or other disorders. 

Methods of inhibiting oncogenic transformation by bovine PV (BPV) 
10 and by human PV (HPV) are provided. 

Methods of inhibiting PV, PV-associated diseases, HIV and HIV- 
associated diseases are provided. These methods are practiced by 
administering the proteins, nucleic acids or rAAV or portions thereof to a 
subject, such as a mammal, including a human to thereby inhibit or 
15 modulate disease progression or oncogenic transformation. 

Other systems 

It has been shown that the Rep protein is involved in the regulation 
of gene expression, including viral replication as described above, cellular 
pathways and protein phosphorylation (see, e.g., Chiorini et al. (1998) 

20 Mol. Cell Biol. 73:5921-5929). Hence the mutant Rep proteins provided 
herein can be used to block, stimulate, inhibit, regulate or otherwise 
modulate metabolic or cellular signaling pathways. Rep proteins provided 
herein can be used to block, stimulate, inhibit, regulate or otherwise 
modulate cyclic AMP response pathways, and also to regulate or 

25 modulate cellular promoters as a means of modulating gene expression. 
Methods using these proteins for such purposes are provided herein. 
Formulation of rAAV 

Pharmaceutical compositions containing the rAAV, fusion proteins 
or encoding nucleic acid molecules can be formulated in any conventional 
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manner by mixing a selected amount of rAAV with one or more 
physiologically acceptable carriers or excipients. For example, the rAAV 
may be suspended in a carrier such as PBS (phosphate buffered saline). 
The active compounds can be administered by any appropriate route, for 
5 example, orally, parenterally, intravenously, intradermally, 

subcutaneously, or topically, in liquid, semi-liquid or solid form and are 
formulated in a manner suitable for each route of administration. 
Preferred modes of administration include oral and parenteral modes of 
administration. 

10 The rAAV and physiologically acceptable salts and solvates may be 

formulated for administration by inhalation or insufflation (either through 
the mouth or the nose) or for oral, buccal, parenteral or rectal 
administration. For administration by inhalation, the rAAV can be 
delivered in the form of an aerosol spray presentation from pressurized 

15 packs or a nebulizer, with the use of a suitable propellant, e.g. 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetra- 
fluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a 
valve to deliver a metered amount. Capsules and cartridges, e.g. of 

20 gelatin, for use in an inhaler or insufflator may be formulated containing a 
powder mix of a therapeutic compound and a suitable powder base such 
as lactose or starch. 

For oral administration, the pharmaceutical compositions may take 
the form of, for example, tablets or capsules prepared by conventional 

25 means with pharmaceutical^ acceptable excipients such as binding 
agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose); fillers (e.g., lactose, microcrystalline 
cellulose or calcium hydrogen phosphate); lubricants (e.g. magnesium 
stearate, talc or silica); disintegrants (e.g. potato starch or sodium starch 
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glycolate); or wetting agents (e.g. sodium lauryl sulphate). The tablets 
may be coated by methods well known in the art. Liquid preparations for 
oral administration may take the form of, for example, solutions, syrups 
or suspensions, or they may be presented as a dry product for 
5 constitution with water or other suitable vehicle before use. Such liquid 
preparations may be prepared by conventional means with 
pharmaceutically acceptable additives such as suspending agents {e.g. 
sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g. lecithin or acacia); non-aqueous vehicles (e.g. 

10 almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and 
preservatives (e.g. methyl or propyl-p-hydroxybenzoates or sorbic acid). 
The preparations may also contain buffer salts, flavoring, coloring and 
sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to 

15 give controlled release of the active compound. For buccal administration 
the compositions may take the form of tablets or lozenges formulated in 
conventional manner. 

The rAAV may be formulated for parenteral administration by 
injection e.g. by bolus injection or continuous infusion. Formulations for 

20 injection may be presented in unit dosage form e.g. in ampoules or in 

multi-dose containers, with an added preservative. The compositions may 
take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. Alternatively, the active ingredient 

25 may be in powder lyophilized form for constitution with a suitable vehicle, 
e.g., sterile pyrogen-free water, before use. 

In addition to the formulations described previously, the rAAV may 
also be formulated as a depot preparation. Such long acting formulations 
may be administered by implantation (for example, subcutaneously or 
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intramuscularly) or by intramuscular injection. Thus, for example, the 
therapeutic compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an acceptable oil) or 
ion exchange resins, or as sparingly soluble derivatives, for example, as a 
5 sparingly soluble salt. 

The active agents may be formulated for local or topical 
application, such as for topical application to the skin and mucous 
membranes, such as in the eye, in the form of gels, creams, and lotions 
and for application to the eye or for intracisternal or intraspinal 

10 application. Such solutions, particularly those intended for ophthalmic 
use, may be formulated as 0.01% - 10% isotonic solutions, pH about 5- 
7, with appropriate salts. The compounds may be formulated as 
aerosols for topical application, such as by inhalation (see, e.g., U.S. 
Patent Nos. 4,044,126, 4,414,209, and 4,364,923, which describe 

1 5 aerosols for delivery of a steroid useful for treatment inflammatory 
diseases, particularly asthma). 

The concentration of active compound in the drug composition will 
depend on absorption, inactivation and excretion rates of the active 
compound, the dosage schedule, and amount administered as well as 

20 other factors known to those of skill in the art. For example, the amount 
that is delivered is sufficient to treat the symptoms of hypertension. 

The compositions may, if desired, be presented in a pack or 
dispenser device which may contain one or more unit dosage forms 
containing the active ingredient. The pack may for example, comprise 

25 metal or plastic foil, such as a blister pack. The pack or dispenser device 
may be accompanied by instructions for administration. 

The active agents may be packaged as articles of manufacture 
containing packaging material, an agent provided herein, and a label that 
indicates the disorder for which the agent is provided. 
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The following example is included for illustrative purposes only and 
is not intended to limit the scope of the invention. The specific methods 
exemplified can be practiced with other species. The examples are 
intended to exemplify generic processes. 
5 EXAMPLE 
Materials and Methods 

Cells: 

293 human embryo kidney (HEK) cells, obtained from ATCC, were 
cultured in Dulbecco's modified Eagle's medium containing 4.5 g/l 
10 glucose (DMEM; GIBGO-BRL) 10 % fetal bovine serum (FBS, Hyclone). 
Hela rep-cap 32 cells, described above, were obtained from Anna Salvetti 
(CHU, Nantes) and cultured in the medium described above. 

Plasmids: 

pNB-Adeno, which encodes the entire E2A and E4 regions and VA 
15 RNA I and II genes of Adenovirus type 5, was constructed by ligating into 
the polylinker of multiple cloning site of pBSII KS ( + /-) (Stratagene, San 
Diego, USA) the Sall-Hindlll fragment (9842-11555 nt) of Adenovirus 
type 5) and the BamHI-Clal fragment (21563- 35950) of pBR325. All 
fragments of adenovirus gene were obtained from the plasmid pBHG-10 
20 (Microbix, Ontario, Canada). pNB-AAV encodes the genes rep and cap of 
AAV-2 and was constructed by ligation of Xbal-Xbal PGR fragment 
containing the genome of AAV-2 from nucleotide 200 to 4480 into Xbal 
site of polylinker MCS of pBSIIKS(-W-). The PCR fragment was obtained 
from pAV1 (ATCC, USA). Plasmid pNB-AAV was derived from plasmid 
25 pVA1l, which contains the AAV genomic region, rep and cap. pNB-AAV 
does not contain the AAV ITR's present in pAV1. pAAV-CMV(nls)LacZ 
was provided by Dr Anna Salvetti (CHU, Nantes). 

Plasmid pCMV(nls)LacZ (rAAV vector plasmid) and pNB-Adeno 
were prepared in DH5a E.coli and purified by Nucleobond AX PC500 Kit 
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(Macherey-Nagel), according to standard procedures. Plasmid pAAV- 
CMV(nIs)LacZ is derived from plasmid psub201 by deleting the rep-cap 
region with SnaB I and replacing it with an expression cassette harboring 
the cytomegalovirus (CMV) immediate early promoter (407 bp), the 
5 nuclear localized £-galactosidase gene and the bovine growth hormone 
polyA signal (324 bp) (see, Chadeuf et al. (2000) J. Gene Med. 2:260- 
268. pAAV-CMV(nls)LacZ was provided by Dr Anna Salvetti. 
Virus: 

Wild type adenovirus (AV) type 5 stock, originally provided by Dr 
10 Philippe Moullier (CHU, Nantes), was produced accordingly to standard 
procedures. 

Construction of Rep mutant libraries 

25 pmol of each mutagenic primer was placed into a 96 PCR well 
plate. 15 p\ of reaction mix (0.25 pmol of pNB-AAV), 25 pmol of the 

15 selection primer (changing one non-essential unique restriction site to a 
new restriction site), 2 //I of 10X mutagenesis buffer (100mM Tris-acetate 
pH7.5, 100 mM MgOAc and 500 mM KOAc pH7.5) was added into each 
well. The samples were incubated at 98 °C for 5 minutes and then 
immediately incubated for 5 minutes on ice. Finally, the plate was placed 

20 at room temperature for 30 minutes. 

The primer extension and ligation reactions of the new strands 
were completed by adding to each sample: 7 //I of nucleotide mix (2,86 
mM each nucleotide and 1.43 X mutagenesis buffer) and 3//I of a fresh 
1:10 enzyme dilution mix (0.025U///I of native T7 DNA polymerase and 

25 1U///I of T4 DNA ligase were diluted in 20mM Tris HCI pH7.5, 10 mM 
KCI, 10 mM mercaptoethanol, 1 mM DTT, 0.1 mM EDTA and 50% 
glycerol). Samples were incubated at 37 °C for 1 hour. The T4 DNA ligase 
was inactivated by incubating the reactions at 72°C for 15 minutes to 
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prevent re-ligation of the digested strands during the digestion of the 
parental plasmid (pNB-AAV). 

Each mutagenesis reaction was digested with restriction enzyme to 
eliminate parental plasmids: 30 jj\ solution containing 3jj\ of 10X enzyme 
5 digestion buffer and 10 units of restriction enzyme were added to each 
mutagenesis reaction and incubated at 37 °C for at least 3 hours. 

90 fj\ of the E. coli XLmutS competent cells (Stratagene, San Diego 
CA; supplemented with 1.5 jj\ of /?-mercaptoethanol to a final 
concentration of 25 mM) were aliquoted into prechilled deep-well plates. 
10 The plates were incubated on ice for 10 minutes and swirling gently every 
2 minutes. 

A fraction of the reactions that had been digested with restriction 
enzyme (1/10 of the total volume) was added to the deep well plates. The 
plates were swirled gently prior to incubation on ice for 30 minutes. A 

15 heat pulse was performed in a 42°C water bath for 45 seconds, the 

transformation mixture was incubated on ice for 2 minutes and 0.45 ml of 
preheated SOC medium (2% (w/v) tryptone, 0.5% (w/v) yeast extract, 
8.5 mM NaCI, 2.5 mM KCI, 10 mM MgCI 2 and 20 mM glucose at pH 7) 
was added. The plates were incubated at 37°C for 1 hour with shaking. 

20 To enrich for mutant plasmids, 1 ml of 2X YT broth medium (YT 

medium is 0.5% yeast extract, 0.5% NaCI, 0.8% bacto-tryptone), 
supplemented with 100 //g/ml of ampicillin, was added to each 
transformation mixture and the cultures were grown overnight at 37 °C 
with shaking. Plasmid DNA isolation was performed from each mutant 

25 culture using standard procedure described in Nucleospin Multi-96 Plus 
Plasmid Kit (Macherey-Nagel). Five hundred fjq of the resulting isolated 
DNA was digested with 1 0 units of the selection restriction enzyme in a 
total volume of 30//I containing 3/c/l of 10X enzyme digestion buffer for 
overnight at 37°C. 
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A fraction of the digested reactions (1/10 of the total volume) were 
transformed into 40 jjI of Epicurian coli XL1-BIue competent cells 
supplemented with 0.68 jj\ of /?-mercaptoethanol to a final concentration 
of 25 mM. After heat pulse, 0.45 ml of SOC was added and the 
5 transformation mixtures were incubated for 1 hour at 37 °C with shaking 
before to be plate on LB-ampicillin agar plates. The agar plates were 
incubated overnight at 37 °C and the colonies obtained were picked up 
and grown overnight at 37°C into deep-well plates. 

Four clones per reaction were screened for the presence of the 
10 mutation using restriction enzyme specific to the new restriction site 

introduced into the mutated plasmid with the selection primer. The cDNA 
from selected clones was also sequenced to confirm the presence of the 
expected mutation. 

Monitoring rAAV Production 
15 rAAV from each of the above wells, were produced by triple 

transfection on 293 HEK cells. 3 x 10 4 cells were seeded into each well 
of 96 micro-well plate and cultured for 24 hours before transfection. 
Transfection was made on cells at about 70% confluency. 25 kDa PEI 
(poly-ethylene-imine, Sigma-Aldrich) was used for the triple transfection 
20 step. Equimolar amounts of the three plasmids AV helper plasmid (pNB- 
Adeno), AAV helper plasmid (pNB-AAV or a mutant clone rep plasmid) 
and vector plasmid (pAAV-CMV(nls)LacZ) were mixed with 10 mM PEI by 
gently shaking. The mixture was the added to the medium culture on the 
cells. 60 hours after transfection, the culture medium was replaced with 
25 100 //I of lysis buffer (50mM Hepes, pH 7.4; 150 mM NaCI; 1mM MgCI 2 ; 
1 mM CaCI 2 ; 0.01% CHAPS). After one cycle of freeze-thawing the 
cellular lysate was filtered through a millipore filter 96 well plate and 
stored at -80 °C. 
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rAAV infection particles (ip) 

Titers of rAAV vector particles were determined on HeLa rep/cap 
32 cells using standard dRA (serial dilution replication assay) test. Ceils 
were plated 24 hours before infection at a density of 1 x 10 4 cells in 96- 
5 well plates. Serial dilutions of the rAAV preparation were made between 1 
and 1 x 10 6 //l and used for co-infection of the HeLa rep/cap 32 cells 
together with wt-AV type 5 (MOI 25). 48 hours after infection the ip 
were measured by real time PCR or by the quantification of biological 
activity of the transgene. 

10 Real Time PCR 

Infected HeLa rep/cap 32 cells were lysed with 50 //I of solution 
(50 mM Hepes, pH 7.4; 150 mM NaCI). After one cycle of freeze-thawing 
50 jj\ of Proteinase K (10 mg/ml) and the lysate were incubated one hour 
at 55 °C. The enzyme was inactivated by incubation 10 min at 96°C. 

15 For real time PCR, 0.2 jjI of lysate was taken. Final volume of the 

reaction was 10 //I in 384 well plate using an Applied Biosystem Prism 
7900. The primers and fluorescence probe set corresponding to the CMV 
promoter were as follows: CMV 1 primer 5'- 

TGCCAAGTACGCCCCCTAT-3' (SEQ ID No. 733) (0.2 //M) and CMV 2 
20 primer 5'-AGGTCATGTACTGGGCATAATGC -3' (SEQ ID No. 734) (0.2 
//M) ; probe VIC-Tamra 5'-TCAATGACGGTAAATGGCCCGCCT-3' (SEQ 
ID No. 735) (0.1 jjM). dRA plots were obtained by plotting the DNA copy 
number (obtained by real time PCR) vs. the dilution of the rAAV 
preparation. 
25 /?-GaIactosidase activity 

After 48 hours of infection, cells were treated with trypsin, and 
1 00 jj\ of reaction solution (GalScreen Kit, Tropix) was added and 
incubated for one hour at 26 °C. Luminescence was measured in 
NorthStar (Tropix) HTS station. dRA plots were obtained plotting the 
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intensity of /?-Galatosidase activity vs. the dilution of the rAAV 
preparation. 

Mathematical Model for results analysis: 

Results were analyzed using the Hill equation-based analysis 
5 (designated NautScan™; see, Patent n° 9915884, 1999, France; 

published as International PCT application No. WO 01/44809 (PCT n° 
PCT/FROO/03503, Dec, 2000). Briefly, data were processed using a Hill 
equation-based model that allows extraction of key feature indicators of 
performance for each individual mutant. Mutants were ranked based on 
10 the values of their individual performance and those at the top of the 
ranking list were selected as Leads. 
Results 

Generation of diversity. 

To identify candidate amino acid (aa) positions on the rep protein 
15 involved in rep protein activity an Ala-scan was performed on the rep 
sequence. For this, each amino acid in the rep protein sequence was 
replaced with Alanine. To do this sets of rAAV that encode mutant rep 
proteins in which each differs from wild type by replacement of one 
amino acid with Ala, were generated. Each set of rAAV was individually 
20 introduced into cells in a well of a microtiter plate, under conditions for 
expression of the rep protein. The amount of virus that could be 
produced from each variant was measured as described below. Briefly, 
activity of Rep was assessed by determining the amount of AAV or rAAV 
produced using infection assays on HeLa Rep-cap 32 cells and by 
25 measurement of AAV DNA replication using Real Time PCR, or by 

assessing transgene (yff-galactosidase) expression. The relative activity of 
each individual mutant compared to the native protein was assessed and 
"hits" identified. Hit positions are the positions in the mutant proteins 
that resulted in an alteration (selected to be at least about 20%), in this 
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instance all resulted in a decrease, in the amount of virus produced 
compared to the activity of the native (wildtype) gene (see Fig. 2A). 

The hits were then used for identification of leads (see, Fig. 2B). 
Assays for Rep activity were performed as described for identification of 
the hit positions. Hit positions on Rep proteins and the effect of specific 
amino acids on the productivity of AAV- 2 summarized in the following 
table: 



Hit position 


replacing amino acid (effect) 


4 (ttt) F 


(get) A (decrease) 




10 (aag) K 


(gcg) A (decrease) 




20 (ccc) P 


(gec) A (decrease) 




22 (att) I 


(get) A (decrease) 




28 (tgg) W 


(gcg) A (decrease) 




32 (gag) E 


(gcg) A (decrease) 




38 (ccg) P 


(gcg) A (decrease) 




39 (cca) P 


(gca) A (decrease) 




54 (ctg) L 


(get) A (decrease) 




59 (ctg) L 


(gcg) A (decrease) 




64 (ctg) L 


(gcg) A (decrease) 




74 (ccg) P 


(gcg) A (decrease) 




86 (gag) E 


(gcg) A (decrease) 




88 (tac) Y 


(gec) A (decease) j 




101 (aaa) K 


(gca) A (decrease) 




1 24 (ate) I 


(gec) A (decrease) 




125 (gag) E 


(gcg) A (decrease) 




1 27 (act) T 


(get) A (decrease) 




1 32 (ttc) F 


(gec) A (decrease) 




140 (ggc) G 


(gec) A (decrease) 





WO 03/018820 



PCT/IB02/04087 



-50- 



Hi1* nn^ition 

1 III |JUO 1 LIUI 1 


replacing amino acid (effect) 


1 61 (acc) T 


\yCC/ M lUcCfcabc/ 




163 (cct) P 


(get; M luccrease/ 




175 (tat) Y 

I/O \ lu 1/ I 


igci; ^ \oecrease; 




1 frtol 1 
1 C/ o \i> iy / 1_ 


(gcg; a (decrease; 




1 Qfi Into) \/ 
1 yg (gig; v 


i9 c g; a (decrease; 






\gcc) a (decrease; 




991 ftr;** <^ 


(gca; a (decrease; 




99fi fntnl V 


(gcg; a (decrease; 




ZO I IClUj L 


(gee; A (decrease} 






(gcg; A (decrease; 




Zo/ \oCC/ I 


(gec; A (decrease; 




9 en /tar>\ V 

zou \iac; T 


(gee; A (decrease; 




ZOO \adL/ IN 


(gec; A (decrease; 




zou \cgg; n 


(gcg; A (decrease; 




ZOO Idlu/ 1 


(gec; A (decrease; 




ZD'f \aay/ IS. 


(gcg; a (uecrease; 




oo*+ vyyy; 


(gcg) A (decrease) 




335 (cct) V 


(get) A (decrease) 




337 (act) T 


(get) A (decrease) 




341 (acc) T 


(gec) A (decrease) 




342 (aac) N 


(gec) A (decrease) 




347 (ata) I 


(gca) A (decrease) 




350 (act) T 


(get) A (decrease 


(aat) N (increase) 


354 (tac) Y 


(gee) A (decrease) 




363 (aac) N 


(gec) A (decrease) 




364 (ttt) F 


(get) A (decrease) 
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i mi puoiiiun 


j replacing amino acid (effect) || 




II *3fi7 taar*\ M 
II oO / \33C) IN 


1/ \ A i t \ 

1 (gec) A (decrease) 






o/u igic) V 


I (gec) A (decrease) 


1 




|| o/o (iggj vv 


If 1 \ A i ■ \ 

1 (gcg) A (decrease) 


1 

j 




| oo I (aag) ix 


| (gcg) A (decrease) 






I ooz. (atg) ivi 


1 (gcg) A (decrease) 


1 

1 




I qoQ c 

I ooy (teg) o 


1 (gcg) A (decrease) 


1 




4U/ (tec) o 


J (gec) A (decrease) 






i 4 I i lata) I 


/ % A M I m 

J (gca) A (decrease) 







41 *v (act; I 


| (get) A (decrease) 





1 n 


4zU (tCC) O 


If \ A i % \ 

1 (get) A (decrease) 







4Z I (aac) IM 


1 / \ A. t 1 % 

\ (gec) A (decrease) 







(acc) l 


1 (gec) A (decrease) 







A OA I --v+o. \ Ail 

I (atg) M 


1 (gcg) A (decrease) 







| 4zo (att) 1 


(get) A (decrease) 







| 4z9 (gac) D I 


(gec) A (decrease) 






4oo (cag) Q j 


(gcg) A (decrease) j 






h-h-vj iccg; r | 


(gcg) A (decrease) | 






451 (acc) T I 


(gec) A (decrease) | 






460 (aag) K I 


(gcg) A (decrease) | 




20 


462 (acc) T | 


(gec) A (decrease) | 


(ata) 1 (increase) | 




484 (ttc) F I 


(gec) A (decrease) | 






488 (aag) K I 


(gcg) A (decrease) | 






495 (ccc) P I 


(gec) A (decrease) | 






497 (ccc) P | 


(gec) A (decrease) | 


(cga) R (increase) | 


25 


497 (ccc) P | 


(gec) A (decrease) | 


(etc) L (increase) | 




497 (ccc) P I 


(gec) A (decrease) | 


(tac) Y (increase) | 
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■ ft * * * * * 

Hit position 


replacing amino acid (effect) 


498 (agt) S 


(get) A (decrease) 




499 (gac) D 


(gec) A (decrease) 




503 (agt) S 


(gcg) A (decrease) 




51 1 (tea) S 


(gca) A (decrease) 




512 (gtt) V 


(get) A (decrease) 




516 (teg) S 


(gcg) A (decrease) 




517 (acg) T 


(get) A (decrease) 


(aac) N (increase) 


518 (tea) S 


(gca) A (decrease) 




519 (gac) D 


(gcg) A (decrease) 




542 (ctg) L 


(gcg) A (decrease) 


(teg) S (increase) 


548 (aga) R 


(gca) A (decrease) 


(age) S (increase) 


598 (gga) G 


(gca) A (decrease) 


(age) S (increase) 


600 (gtg) V 


(gcg) A (decrease) 


(ccg) P (increase) 


601 (cca) P 


(gca) A (decrease) 




Hit position 
(within intron) 


replacing sequence (effect) 


630 (tgc) 


gcg (decrease) 


cgc or tea or cct 
(increase) 



The hits in other AAV serotypes (see, also Figures 3A and 3B) are 
20 as follows: 



HIT POSITION 


AAV-2 


AAV-1 


AAV-3 


AAV-3 B 


AAV-4 


AAV-6 


AAV-5 


4 


4 


4 


4 


4 


4 


4 


10 


10 


10 


10 


10 


10 


10 


20 


20 


20 


20 


20 


20 


20 


22 


22 


22 


22 


22 


22 


22 


29 


29 


29 


29 


29 


29 


29 
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HIT POSITION 



32 

W A. 


32 


32 

w £— 


32 

W £— 


32 


32 

w A. 


32 

w 


38 


38 


38 

W w 


38 


38 

w \J 


! 38 

I W U 


38 

WW 


39 

w w 


39 

w W 


39 

WW 


39 

w w 


39 


39 

w w 


39 

WW 


54 

w^ 


54 

w ™T 


54 


54 

W*T 


54 


54 


54 


59 


59 

W W 


59 

w *J 


59 


59 

WW 


59 


59 

w w 


64 


64 


64 


64 


64 


64 


64 


74 


74 


74 


74 


74 


74 








86 


86 
oo 


' 86 
oo 


86 
oo 


8R 

OS 




RR 


88 
oo 


88 
oo 


88 
oo 


88 
oo 


87 
o / 


1 01 


1 ni 


101 


1 01 


1 ni 


1 01 

IU 1 


1 on 

1 oo 


1 94 


1 74 


1 24 


1 9d 


1 94. 


1 94. 


1 9*3 
I Z. O 


1 25 


1 25 


1 25 


1 9R 

1 ZO 


1 9R 


1 9R 


1 9 A 


1 97 


1 27 


1 27 


1 97 


1 97 
I z. / 


1 97 


1 9fi 


1 *39 


1 "39 
I o<£ 


1 10 


1 *39 
I Odd 


I O^ 


iqo 


1 O 1 


1 AD 

1 **-U 


1 AH 


1 An 

1 *+U 




1 An 


i ah 

1 H-w 1 




ID 1 


1 fi1 

I O I 


1 O i 


1 R1 
1 D 1 


lei 

1 O 1 


I O I 


I DO 


1 fi^ 




1 DO 


1 OO 


I DO 


l DO 


1 ftn 

I DU 


175 


175 
• * w 


175 


175 


1 75 


1 75 

1 / w 


1 79 


193 


193 


193 


193 


193 


193 


190 


196 


196 


196 


196 


196 


196 


193 


197 


197 


197 


197 


197 


197 


194 


221 


221 


221 


221 


221 


221 


217 


228 


228 


228 


228 


228 


228 


224 


231 


231 


231 


231 


231 


231 


227 


234 


234 


234 


234 


234 


234 


230 


237 


237 


237 


237 


237 


237 


233 


250 


250 


250 


250 


250 


250 


246 


258 


258 


258 


258 


258 


258 


254 


260 


260 


260 


260 


260 


260 


256 
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HIT POSITION 





96*3 




263 


263 


263 


259 

^m- W W 


964 




964 


264 


264 
\j 


264 


260 


oo*+ 






oo*-r 


334 


334 


330 


OvJO 


000 


000 


OOO 


3*3o" 
00 0 




331 


QQ7 
OO / 


007 
00 / 


OO / 


OO / 


^^7 
OO / 


00 / 


OOO 


QA1 
O^f I 


OA 1 


1AA 
04 I 


0*r 1 


0*r ! 


*341 

0*T I 


337 


OA9 


*3A9 


*3A9 


0*rZ 




*3A9 

0*T^1 


338 

OOO 


QA7 
Of/ 


"3ZL7 
o*f / 


"3/17 
0*+ / 


LA7 

0*T / 


0*r / 


LA7 

0*r / 




OCA 

oOU 


ocn 
OOU 


OOU 


OOU 


OOU 


OOU 


OtU 






QCjI 

00*f 


OCA 


00*+ 


O0*r 


OOU 




OCT 

ODO 


OOO 


OOO 


OOO 


OOO 


OOj 






004 


ODH- 


OOH- 


oo*f 


OOU 


OCT 
OO / 


OCT 

OO / 


OCT 

00 / 


OCT 

OO / 


OO / 


OO / 


OOO 


0 /U 


0 /u 


0 /u 


O /U 


070 
O / U 


Q7n 
0 / u 


OOO 


O /O 


O/O 


O/O 


O/O 


O/O 


O/O 


"379 

O / £~ 


ool 


001 

ool 


00 1 

ool 


OQ1 
OO I 


OO I 


001 
OO I 


0 / / 


382 


000 
OOZ 


OOO 

ooz 


OOO 

ooz 


OOO 

ooz 


QQO 
ooz 


07R 
O/O 




ooy 


9.P.Q 

ooy 


ooy 


IRQ 
OOO 


OOO 


385 

w w w 


407 


407 


407 


407 


407 


407 


403 


411 


411 


411 


411 


411 


411 


407 


414 


414 


414 


414 


414 


414 


410 


420 


420 


420 


420 


420 


420 


416 


421 


421 


421 


421 


421 


421 


417 


422 


422 


422 


422 


422 


422 


418 


424 


424 


424 


424 


424 


424 


420 


428 


428 


428 


428 


428 


428 


424 


429 


429 


429 


429 


429 


429 


425 


438 


438 


438 


438 


438 


438 


434 


440 


440 


440 


440 


440 


440 


436 



10 



15 



20 



25 



WO 03/018820 



PCT/IB02/04087 



-55- 



HIT POSITION 



10 



15 



20 



25 



451 


451 


451 


451 


451 


*+ \j i 


AA7 


460 


460 


460 


460 


460 




*rOO 


462 

i ,P T 


462 


462 


46? 








! 484 


484 


484 


4RA 


tO*t 


AQA 


Aon 


1 488 


488 


488 


48ft 
too 


Aft ft 


/tOQ 

T-OO 




495 


495 


495 


4Q*> 


AQK 




AQ1 

^y i 


1 497 


497 


497 


4.Q7 


AQ7 


AQ7 


AQQ 


498 


438 


4-98 


4Qft 




f AOS 


AQA 


1 49Q 


4.QQ 


AQQ 


AQQ 




AQQ 




503 


503 


503 


503 


503 


503 


499 


511 


511 


511 


511 


511 


511 


529 


I 512 


512 


512 


512 


512 


512 


530 


516 


516 


516 


516 


516 


516 


534 


1 517 


517 


517 


517 


517 


517 


535 


I 518 


518 


518 


518 


518 


518 


536 


519 


519 


519 


519 


519 


519 


537 


542 


543 


542 


542 


542 


543 


561 


J 548 


549 


548 


548 


548 


549 


567 


I 598 


599 


600 


600 


599 


599 




I 600 


602 


603 


603 


602 


602 


589 


601 


603 


604 


604 


603 


603 


590 



Sets of nucleic acids encoding the rep protein were generated. The 
rep proteins encoded by these sets of nucleic acid molecules were those 
in which each amino acid position identified as a "hit" in the ala-scan 
step, were each sequentially replaced by all remaining 18 amino acids 
using site directed mutagenesis. Each mutant was designed, generated, 
processed and analyzed physically separated from the others in 
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addressable arrays. No mixtures, pools, nor combinatorial processing 
were used. 

As in the first round (alanine scan), a library of mutant rAAV was 
generated in which each individual mutant was independently and 
5 individually generated in a independent reaction and such that each 

mutant contains only a single amino acid change and this for each amino 
acid residue. Again, each resulting mutant rep protein was then 
expressed and the amount of virus produced in cells assessed and 
compared to the native protein. 

10 Lead identification 

Since rep proteins that result in increased virus production are of 
interest, those mutants that lead to an increase in the amount of virus 
produced (2 to 10 times the native activity), were selected as "leads." 
Ten such mutants were identified. 

15 Based on the results obtained from the assays described above (i.e. 

titer of virus produced by each rep variant), each individual rep variant 
was assigned a specific activity. Those variant proteins displaying the 
highest titers were selected as leads (see Table above). Leads include: 
amino acid replacement of T by N at Hit position 350; T by I at Hit 

20 position 462; P by R at Hit position 497; P by L at Hit position 497; P by 
Y at Hit position 497; T by N at Hit position 517; L by S at Hit position 
542; R by S at Hit positio 548, G by S at Hit position 598; G by D at Hit 
position 598; V by P at Hit position 600. 

Also provided are combinations of the above mutant Rep 78, 68, 

25 52. 40 proteins, nucleic acids encoding the proteins, and recombinant 
AAV (any serotype) containing the mutation at the indicated position or 
corresponding position for serotypes other than AAV-2, including any set 
forth in the following table and corresponding SEQ ID Nos. Each amino 
acid sequence is set forth in a separate sequence ID listing; for each 
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mutation or combination thereof there is a single SEQ ID setting forth the 
unspliced nucleic acid sequence for Rep78/68, which for all mutations 
from amino acid 228 on, includes the corresponding Rep 52 and Rep 40 

encoding sequence as well. 

* » 

5 Amino acid sequences of exemplary mutant Rep proteins 

Seq no. gene position(s) codon(s) 

• 

seq.1 rep78 4 GCT 

seq.2 rep68 4 GCT 

seq.3 rep78 10 GCG 

10 seq.4 rep68 10 GCG 

seq. 5 rep78 20 GCC 

seq. 6 rep68 20 GCC 

seq. 7 rep78 22 GCT 

seq.8 rep68 22 GCT 

15 seq. 9 rep78 29 GCG 

seq. 10 rep68 29 GCG 

seq. 11 rep78 38 GCG 

seq. 12 rep68 38 GCG 

seq. 13 rep78 39 GCA 

20 seq. 14 rep68 39 GCA 

seq. 15 rep78 53 GCT 

seq. 16 rep68 53 GCT 

seq. 17 rep78 59 GCG 

seq. 18 rep68 59 GCG 

25 seq. 19 rep78 64 GCT 

seq. 20 rep68 64 GCT 

seq. 21 rep78 74 GCG 

seq. 22 ' rep68 74 GCG 

seq. 23 rep78 86 GCG 

30 seq. 24 rep68 86 GCG 

seq. 25 rep78 88 GCC 

seq. 26 rep68 88 GCC 

seq. 27 rep78 101 GCA 

seq. 28 rep68 101 GCA 

35 seq. 29 rep78 124 GCC 

seq. 30 rep68 124 GCC 

seq.31 rep78 125 GCG 

seq. 32 rep68 1 25 GCG 

seq. 33 rep78 127 GCT 

40 seq. 34 rep68 127 GCT 

seq .35 rep78 1 32 GCC 

seq.36 rep68 132 GCC 

seq. 37 rep78 140 GCC 

seq. 38 rep68 140 GCC 

45 seq. 39 rep78 161 GCC 

seq.40 rep68 161 GCC 
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sea. 41 


reD78 


163 


GCT 






ron68 

1 W ImT \m* \J 


163 


GCT 




sen 4-3 


ren78 


175 


GCT 




sea 44 


reo68 


175 


GCT 


5 


sea 4B 


reD78 


193 


GCG 




sea 46 


reD68 


193 


GCG 




cpn 47 


reo78 


196 


GCC 




con 4-8 


reo68 

1 W W w w 


196 


GCC 




cpn AQ 


ron78 


197 


GCC 


10 


c An RO 


rpn68 


1 97 


GCC 

\mM W 




con R 1 


rpn7R 


221 


GCA 




con R 


rpnfiR 

1 CpUU 


221 


GCA 




con R *3 




228 


GCG 

Vrf v> 




con Rzl 


rpn59 


228 


GCG 

V»J Vp* Vp* 


15 


con R R 


rpnfiR 


228 


GCG 




con 


rpndO 


228 


GCG 




con R 7 




231 


GCC 




con Rfl 
oulj .JO 


1 C JJ w ^ 


231 


GCC 




con R Q 
ot>LJ « *J C7 




231 


GCC 




con AO 




231 


GCC 




con A 1 
ocLj.O 1 


rpn7R 


234 


GCG 

u v_j 




con 


1 t?p%J ^> 


934 


GCG 

V-i VJ 




con ft Q 




234 


GCG 




can 




934 


GCG 

vj v_i 


25 


con ft R 


rpn7ft 


537 


GCC 




con ft ft 




237 


GCC 




con ft 7 


ronftP 


237 


GCC 

VJ 




con ft £3 




937 


GCC 

WWW 




con RQ 


rpn7P 


950 


GCC 
>j ^ 




can 7f^ 
bcL] . I\J 


i op zj £~ 


9ro 


GCC 




c An 7 1 

seq. / i 


ronftft 


9RO 






can 7*3 

seq . / <i 


rep*f v/ 


9RO 


ncr 




seq. / o 


rep / o 


zoo 


fifC 

www 




seq. /*t- 


repoz 


ZOO 


O WW 


35 


seq.75 


rep68 


258 


GCC 




seq. 76 


rep40 


258 


GCC 




seq. 77 


rep78 


260 


GCG 




seq. 78 


rep52 


260 


GCG 




seq. 79 


rep68 


260 


GCG 


40 


seq. 80 


rep40 


260 


GCG 




seq. 81 


rep78 


263 


GCC 




seq. 82 


rep52 


263 


GCC 




seq. 83 


rep68 


263 


GCC 




seq. 84 


rep40 


263 


GCC 


45 


seq. 85 


rep78 


264 


GCG 




seq. 86 


rep 52 


264 


GCG 




seq. 87 


rep68 


264 


GCG 




seq. 88 


rep40 


264 


GCG 




seq. 89 


rep78 


334 


GCG 


50 


seq. 90 


rep52 


334 


GCG 




seq. 91 


rep68 


334 


GCG 
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seq.92 rep40 

seq.93 rep78 

seq.94 rep52 

seq.95 rep68 

5 seq.96 rep40 

seq.97 rep78 

seq.98 rep52 

seq.99 rep68 

seq.100 rep40 

10 seq.101 rep78 

seq.102 rep52 

seq.103 rep68 

seq.104 rep40 

seq.105 rep78 

15 seq.106 rep52 

seq.107 rep68 

seq.108 rep40 

seq.109 rep78 

seq.110 rep52 

20 seq.111 rep68 

seq.112 rep40 

seq.113 rep78 

seq.114 rep52 

seq.115 rep68 

25 seq.116 rep40 

seq.117 rep78 

seq.118 rep52 

seq.119 rep68 

seq.120 rep40 

30 seq.121 rep78 

seq.122 rep52 

seq.123 rep68 

seq.124* rep40 

seq.125 rep78 

35 seq.126 rep 52 

seq.127 rep68 

seq.128 rep40 

seq.129 rep78 

seq.130 rep52 

40 seq.131 rep68 

seq.132 rep40 

seq.133 rep78 

seq.134 rep52 

seq.135 rep68 

45 seq.136 rep40 

seq.137 rep78 

seq.138 rep52 

seq.139 rep68 

seq.140 rep40 

50 seq.141 rep78 

seq.142 rep52 



-59- 



334 


GCG 


r% O fT 

335 


GCT 


O r- 

335 


GCT 


n r- 

335 


GCT 


335 


GCT 


337 


GCT 


337 


j^s Am% wmm 

GCT 


337 


GCT 


337 


GCT 


341 


^-V A** 

GCC 


A Al 

341 


GCC 


341 


A^. A^± A^k, 

GCC 


341 


GCC 


342 


GCC 


342 


GCC 


342 


GCC 


342 


GCC 


347 


GCA 


347 


GCA 


347 


GCA 


347 


GCA 


350 


AAT 


350 


AAT 


350 


AAT 


350 


AAT 


350 


GCT 


350 


GCT 


350 


GCT 


350 


GCT 


354 


GCC 


354 


GCC 


354 


GCC 


354 


GCC 


363 


GCC 


363 


GCC 


363 


GCC 


363 


GCC 


364 


GCT 


364 


GCT 


364 


GCT 


364 


GCT 


367 


GCC 


367 


GCC 


367 


GCC 


367 


GCC 


370 


GCC 


370 


GCC 


370 


GCC 


370 


GCC 


376 


GCG 


376 


GCG 
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seq.143 rep68 

seq. 144 rep40 

seq.145 rep78 

seq.146 rep52 

5 seq.147 rep68 

seq. 148 rep40 

seq. 149 rep78 

seq.150 rep52 

seq.151 rep68 

10 seq.152 rep40 

seq.153 rep78 

seq. 154 rep52 

seq.155 rep68 

seq. 156 rep40 

15 seq.157 rep78 

seq.158 rep52 

seq.1 59 rep68 

seq.160 rep40 

seq.161 rep78 

20 seq.162 rep52 

seq.163 rep68 

seq.164 rep40 

seq.165 rep78 

seq.166 rep52 

25 seq.167 rep68 

seq.168 rep40 

seq.169 rep78 

seq.170 rep52 

seq.171 rep68 

30 seq.172 rep40 

seq.173 rep78 

seq.174 rep52 

seq.175" rep68 

seq. 176 rep40 

35 seq. 177 rep78 

seq. 178 rep52 

seq. 179 rep68 

seq. 180 rep40 

seq. 181 rep78 

40 seq. 182 rep52 

seq. 183 rep68 

seq. 184 rep40 

seq. 185 rep78 

seq. 186 rep52 

45 seq. 187 rep68 

seq. 188 rep40 

seq. 189 rep78 

seq. 190 rep52 

seq. 191 rep68 

50 seq. 192 rep40 

seq. 193 rep78 



Q7fi 
o / u 




Q7fi 








381 






O Vj- \J 


381 




^89 




Q89 




Q89 


o oo 


ooz 


o on 


ooy 








QQQ 

ooy 


o Ofi 


QQQ 


OOO 




but 




oLL 


AJC\ 1 
















A 1 1 




*f I 1 




A 1 /f 


POT 




o / » i 


A-% A 


o p-T* 


A1 


o pt 




O OT" 




O OT" 




O PT 


a on 






o oo 




y-» OO 

GCC 


421 


GCC 


421 


GCC 


422 


GCC 


422 


GCC 


422 


GCC 


422 


GCC 


424 


GCG 


424 


GCG 


424 


GCG 


424 


GCG 


428 


GCT 


428 


GCT 


428 


GCT 


428 


GCT 


429 


GCC 


429 


GCC 


429 


GCC 


429 


GCC 


438 


GCG 
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seq.194 rep52 
seq.195 rep68 
seq.196 rep40 
seq.197 rep78 
5 seq.198 rep52 
seq.199 rep68 
seq.200 rep40 
seq.201 rep78 
seq.202 rep52 
10 seq.203 rep68 
seq.204 rep40 
seq.205 rep78 
seq.206 rep52 
seq.207 rep68 
15 seq.208 rep40 
seq.209 rep78 
seq.210 rep52 
seq.211 rep68 
seq.212 rep40 
20 seq.213 rep78 

seq.214 rep52 

seq.215 rep68 

seq.216 rep40 

seq.217 rep78 

25 seq.218 rep52 

seq.219 rep68 

seq.220 rep40 

seq.221 rep78 

seq.222 rep52 

30 seq.223 rep68 

seq.224 rep40 

seq.225- rep78 

seq.226 rep52 

seq.227 rep68 

35 seq.228 rep40 

seq.229 rep78 

seq.230 rep52 

seq.231 rep68 

seq.232 rep40 

40 . seq.233 rep78 

seq.234 rep52 

seq.235 rep68 

seq.236 rep40 

seq.237 rep78 

45 seq.238 rep52 

seq.239 rep68 

seq.240 rep40 

seq.241 rep78 

seq.242 rep52 

50 seq.243 rep68 

seq.244 rep40 



438 


GCG 


438 


GCG 


438 


GCG 


440 


GCG 


440 


GCG 


440 


GCG 


440 


GCG 


451 


GCC 


451 


GCC 


451 


GCC 


451 


GCC 


460 


GCG 


460 


GCG 


460 


GCG 


460 


GCG 


462 


GCC 


462 


GCC 


462 


GCC 


462 


GCC 


462 


ATA 


462 


ATA 


462 


ATA 


462 


ATA 


484 


GCC 


484 


GCC 


484 


GCC 


484 


GCC 


488 


GCG 


488 


GCG 


488 


GCG 


488 


GCG 


495 


GCC 


495 


GCC 


495 


GCC 


495 


GCC 


497 


GCC 


497 


GCC 


497 


GCC 


497 


GCC 


497 


CGA 


497 


CGA 


497 


CGA 


497 


CGA 


497 


CTC 


497 


CTC 


497 


CTC 


497 


CTC 


497 


TAC 


497 


TAC 


497 


TAC 


497 


TAC 



WO 03/018820 



PCT/IB02/04087 



-62- 





seq.245 


rep78 


498 


GCT 




seq.246 


rep52 


498 


GCT 




seq.247 


rep68 


498 


GCT 




seq.248 


rep40 


498 


GCT 


5 


seq.249 


rep78 


499 


GCC 




seq.250 


rep52 


499 


GCC 




seq.251 


rep68 


499 


GCC 




seq.252 


rep40 


499 


GCC 




seq.253 


rep78 


503 


GCG 


10 


seq.254 


rep52 


503 


GCG 




seq.255 


rep68 


503 


GCG 




seq.256 


rep40 


503 


GCG 




seq.257 


rep78 


510 


GCA 




seq.258 


rep52 


510 


GCA 


15 


seq.259 


rep68 


510 


GCA 




seq.260 


rep40 


510 


GCA 




seq.261 


rep78 


511 


GCA 




seq.262 


rep52 


511 


GCA 




seq.263 


rep68 


511 


GCA 


20 


seq.264 


rep40 


511 


GCA 




seq.265 


rep78 


512 


GCT 




seq.266 


rep52 


512 


GCT 




seq.267 


rep68 


512 


GCT 




seq.268 


rep40 


512 


GCT 


25 


seq.269 


rep78 


516 


GCG 




seq.270 


rep52 


516 


GCG 




seq.271 


rep68 


516 


GCG 




seq.272 


rep40 


516 


GCG 




seq.273 


rep78 


517 


GCT 


30 


seq.274 


rep 52 


517 


GCT 




seq.275 


rep68 


517 


GCT 




seq.276- 


rep40 


517 


GCT 




seq.277 


rep78 


517 


AAC 




seq.278 


rep52 


517 


AAC 


35 


seq.279 


rep68 


517 


AAC 




seq.280 


rep40 


517 


AAC 




seq.281 


rep78 


518 


GCA 




seq.282 


rep52 


518 


GCA 




seq.283 


rep68 


518 


GCA 


40 


seq.284 


rep40 


518 


GCA 




seq.285 


rep78 


519 


GCG 




seq.286 


rep52 


519 


GCG 




seq.287 


rep68 


519 


GCG 




seq.288 


rep40 


519 


GCG 


45 


seq.289 


rep78 


598 


GCA 




seq.290 


rep52 


598 


GCA 




seq.291 


rep78 


598 


GAC 




seq.292 


rep52 


598 


GAC 




seq.293 


rep78 


598 


AGC 


50 


seq.294 


rep52 


598 


AGC 




seq.295 


rep78 


600 


GCG 



WO 03/018820 



PCT/TB02/04087 



-63- 



seq.296 
seq.297 
seq.298 
seq.299 
5 seq.300 
seq.301 
seq.302 
seq.303 
seq.304 
10 seq.305 
seq.306 
seq.307 
seq.308 
seq.309 
15 seq.310 
seq.31 1 
seq.31 2 
seq.313 
seq.314 
20 seq.31 5 
seq.31 6 
seq.31 7 
seq.31 8 
seq.31 9 
25 seq.320 
seq.321 
seq.322 
seq.323 
seq.324 
30 seq.325 
seq.326 
seq.327* 
seq.328 
seq.329 
35 seq.330 
seq.331 
seq.332 
seq.333 
seq.334 
40 seq.335 
seq.336 
seq.337 
seq.338 
seq.339 
45 seq.340 
seq.341 
seq.342 
seq.343 
seq.344 
50 seq.345 
seq.346 



rep 5 2 

rep78 

rep 5 2 

rep78 

rep52 

rep78 

rep 5 2 

rep68 

rep40 

rep78 

rep68 

rep78 

rep52 

rep68 

rep40 

rep78 

rep52 

rep68 

rep40 

rep78 

rep 5 2 

rep68 

rep40 

rep78 

rep68 

rep78 

rep52 

rep68 

rep40 

rep78 

rep52 

rep68 

rep40 

rep78 

rep 5 2 

rep68 

rep40 

rep78 

rep 5 2 

rep68 

rep40 

rep 78 

rep52 

rep68 

rep40 

rep 78 

rep52 

rep68 

rep40 

rep78 

rep52 



600 
600 
600 
601 
601 

335 420 495 
335 420 495 
335 420 495 
335 420 495 
39 140 
39 140 
279 428 451 
279 428 451 
279 428 451 
279 428 451 
1 25 237 600 
125 237 600 
125 237 600 
125 237 600 
163 259 
163 259 
163 259 
163 259 
17 127 189 
17 127 189 
350 428 
350 428 
350 428 
350 428 
54 338 495 
54 338 495 
54 338 495 
54 338 495 
350 420 
350 420 
350 420 
350 420 
189 197 518 
189 197 518 
189 197 518 
189 197 518 
468 516 
468 516 
468 516 
468 516 



GCG 
CCG 
CCG 
GCA 
GCA 

GCT GCC GCC 
GCT GCC GCC 
GCT GCC GCC 
GCT GCC GCC 
GCA GCC 
GCA GCC 
GCC GCT GCC 
GCC GCT GCC 
GCC GCT GCC 
GCC GCT GCC 
GCG GCC GCG 
GCG GCC GCG 
GCG GCC GCG 
GCG GCC GCG 
GCT GCG 
GCT GCG 
GCT GCG 
GCT GCG 
GCG GCT GCG 
GCG GCT GCG 
GCT GCT 
GCT GCT 
GCT GCT 
GCT GCT 
GCC GCC GCC 
GCC GCC GCC 
GCC GCC GCC 
GCC GCC GCC 
GCT GCC 
GCT GCC 
GCT GCC 
GCT GCC 
GCG GCG GCA 
GCG GCG GCA 
GCG GCG GCA 
GCG GCG GCA 
GCC GCG 
GCC GCG 
GCC GCG 



GCC GCG 

1 27 221 350 54 140 GCT GCA GCT GCC GCC 

127 221 350 54 140 GCT GCA GCT GCC GCC 

127 221 350 54 140 GCT GCA GCT GCC GCC 

127 221 350 54 140 GCT GCA GCT GCC GCC 

221 285 GCA GCG 

221 285 GCA GCG 
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seq.347 rep68 
seq.348 rep40 
seq.349 rep78 
seq.350 rep52 
5 seq.351 rep68 
seq.352 rep40 
seq.353 rep78 
seq.354 rep52 
seq.355 rep68 
10 seq.356 rep40 
seq.357 rep78 
seq.358 rep52 
seq.359 rep68 
seq.360 rep40 
15 seq.361 rep78 
seq.362 rep52 
seq.363 rep68 
seq.364 rep40 
seq.365 rep78 
20 seq.366 rep52 
seq.367 rep68 
seq.368 rep40 
seq.369 rep78 
seq.370 rep52 
25 seq.371 rep68 
seq.372 rep40 
seq.373 rep78 

seq.374 rep68 

seq.375 rep78 

30 seq.376 rep52 

seq.377 rep68 

seq.378" rep40 

seq.379 rep78 

seq.380 rep52 

35 seq.381 rep68 

seq.382 rep40 

seq.383 rep78 

seq.384 rep52 

seq.385 rep68 

40 seq.386 rep40 

seq.387 rep78 

seq.388 rep52 

seq.389 rep68 

seq.390 rep40 

45 seq.391 rep78 

seq.392 rep52 

seq.393 rep68 

seq.394 rep40 

seq.395 rep78 

50 seq.396 rep52 

seq.397 rep68 



221 285 
221 285 
23 495 
23 495 
23 495 
23 495 

20 54 420 495 

20 54 420 495 

20 54 420 495 

20 54 420 495 

412 612 

412 612 

412 612 

412 612 

197 412 

197 412 

197 412 

197 412 

412 495 511 

412495 511 

412 495 511 

412 495 511 

98 422 

98 422 

98 422 

98 422 

17 127 189 

17 127 189 
20 54 495 
20 54 495 
20 54 495 
20 54 495 
259 54 
259 54 
259 54 
259 54 
335 399 
335 399 
335 399 
335 399 
221 432 
221 432 
221 432 
221 432 
259 516 
259 516 
259 516. 
259 516 ' 
495 516 
495 516 
495 516 



GCA GCG 
GCA GCG 
GCT GCC 
GCT GCC 
GCT GCC 
GCT GCC 

GCC GCC GCC GCC 

GCC GCC GCC GCC 

GCC GCC GCC GCC 

GCC GCC GCC GCC 

GCC GCG 

GCC GCG 

GCC GCG 

GCC GCG 

GCG GCC 

GCG GCC 

GCG GCC 

GCG GCC 

GCC GCC GCA 

GCC GCC GCA 

GCC GCC GCA 

GCC GCC GCA 

GCC GCC 

GCC GCC 

GCC GCC 

GCC GCC 

GCG GCT GCG 

GCG GCT GCG 
GCC GCC GCC 
GCC GCC GCC 
GCC GCC GCC 
GCC GCC GCC 
GCG GCC 
GCG GCC 
GCG GCC 
GCG GCC 
GCT GCG 
GCT GCG 
GCT GCG 
GCT GCG 
GCA GCA 
GCA GCA 
GCA GCA 
GCA GCA 
GCG GCG 
GCG GCG 
GCG GCG 
GCG GCG 
GCC GCG 
GCC GCG 
GCC GCG 
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seq.398 rep40 495 516 GCC GCG 

seq.399 rep78 414 14 GCT GCC 

seq.400 rep52 414 14 GCT GCC 

seq.401 rep68 414 14 GCT GCC 

5 seq.402 rep40 414 14 GCT GCC 

seq.403 rep78 74 402 495 GCG GCC GCC 

seq.404 rep52 74 402 495 GCG GCC GCC 

seq.405 rep68 74 402 495 GCG GCC GCC 

seq.406 rep40 74 402 495 GCG GCC GCC 

10 seq.407 rep78 228 462 497 GCC GCC GCC 

seq.408 rep52 228 462 497 GCC GCC GCC 

seq.409 rep68 228 462 497 GCC GCC GCC 

seq.410 rep40 228 462 497 GCC GCC GCC 

seq.41 1 rep78 290 338 GCG GCC 

15 seq.41 2 rep52 290 338 GCG GCC 

seq.41 3 rep68 290 338 GCG GCC 

seq.414 rep40 290 338 GCG GCC 

seq.415 rep78 140 511 GCC GCA 

seq.416 rep52 140 511 GCC GCA 

20 seq.417 rep68 140 511 GCC GCA 

seq.418 rep40 140 511 GCC GCA 

seq.41 9 rep78 86 378 GCG GCG 

seq.420 rep52 86 378 GCG GCG 

seq.421 rep68 86 378 GCG GCG 

25 seq.422 rep40 86 378 GCG GCG 

seq.423 rep78 54 86 GCC GCG 

seq.424 rep68 54 86 GCC GCG 

seq.425 rep78 54 86 GCC GCG 

seq.426 rep68 54 86 GCC GCG 

30 seq.427 rep78 214 495 140 GCG GCC GCC 

seq.428 rep52 214 495 140 GCG GCC GCC 

seq.429" rep68 214 495 140 GCG GCC GCC 

seq.430 rep40 214 495 140 GCG GCC GCC 

seq.431 rep78 495 511 GCC GCA 

35 seq.432 rep52 495 51 1 GCC GCA 

seq.433 rep68 495 511 GCC GCA 

seq.434 rep40 495 511 GCC GCA 

seq.435 rep78 495 54 GCC GCC 

seq.436 rep52 495 54 GCC GCC 

40 seq.437 rep68 495 54 GCC GCC 

seq.438 rep40 495 54 GCC GCC 

seq.439 rep78 197 495 GCG GCC 

seq.440 rep52 197 495 GCG GCC 

seq.441 rep68 197 495 GCG GCC 

45 seq.442 rep40 197 495 GCG GCC 

seq.443 rep78 261 20 GCC GCC 

seq.444 rep52 261 20 GCC GCC 

seq.445 rep68 261 20 GCC GCC 

seq.446 rep40 261 20 GCC GCC 

50 seq.447 rep78 54 20 GCC GCC 

seq.448 rep68 54 20 GCC GCC 
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spfr 4-4-3 


reo78 


1 97 420 


GCG GCC 




spq 4B0 


reo52 

1 **** X* 


1 97 420 


GCG GCC 




sea 45 1 


rep68 


1 97 420 


GCG GCC 




*;pci 452 


reo40 


197 420 


GCG GCC 


5 
w 


cpn 4-53 

O \j V_| * w w 


reo78 


54 338 495 

r ^^^^^^ ■ 


GCC GCC GCC 




cpn 4S4 


ren52 

1 V** I-* w 


54 338 495 

r x^ x^ x^ ■ 


GCC GCC GCC 




OCLj • *T W \J 


ren68 

1 w yj \mf 


54 338 495 


GCC GCC GCC 




cpn 4.R6 


reo40 


54 338 495 

%J ■ X' X^ X^ ■ 


GCC GCC GCC 




cpn 4^*7 


reD78 

1 op / W 


197 427 

■ x* # ■ *— # 


GCG GCG 


10 


cpn 4RR 


ren52 

1 %»WW^» 


197 427 


GCG GCG 




cpn 4.RQ 


renfiR 
• w w 


197 427 


GCG GCG 




can AfiO 




197 427 


GCG GCG 




can Afi 1 


rpr»78 

1 JJ # W 


54 228 370 387 


GCC GCC GCC GCG 




can AR O 




54 228 370 387 


GCC GCC GCC GCG 


15 


cpn 4fi^ 
oCLj .*twO 


rpnfift 


54 228 370 387 


GCC GCC GCC GCG 




con A &a 
boLJ .4UH 


rp nAf) 
1 GJJ*tw 


54 228 370 387 


GCC GCC GCC GCG 




can Zlfi^ 


rer>78 

1 wU / W 


221 289 


GCA GCC 




cpn 4-fifi 


rpn 1 !? 

1 Cyv4 


221 289 


GCA GCC 




can /I ft 7 




221 289 


GCA GCC 


20 


cpn AfiR 


rpn40 

1 C|J"TW^ 


221 289 


GCA GCC 




cpn 4.RQ 


n»r>78 


54 163 


GCC GCT 




can A77(\ 


rpnfiR 


54 163 

W^ 1 W w 


GCC GCT 




can "7 "1 


ron7R 
1 C[J / 0 


34.1 4.07 420 


GCC GCC GCC 




can 479 
ScL].*t / *d 


r t;po^. 


34.1 407 420 


GCC GCC GCC 




can A7*3 


rpnfiR 


341 407 420 


GCC GCC GCC 




can /I 7 /I 
OCLJ . *r / *+ 


ronAH 

1 GJJ*tVs 


341 407 420 


GCC GCC GCC 

www w W WW 




can An R 

bCLj . *T / 5J 




54 228 


GCC GCC 

w w v»* x^ 




can 4 7ft 
ocl] .*+ / O 




54 228 


GCC GCC 

w W x^ x^ x^r 




can 477 

seq.f/ / 


I CfJUO 


54 228 


GCC GCC 

VJ W V W w» w 


ou 


can /17P 


rnn /I Oj 

1 cp*rU 


R4 228 

w*T ^^W 


GCC GCC 




seq,^/y 


rep /o 


Qfi 1 9^ R1 1 
9D 1 U 1 1 


flpA GCG GCA 




seq ou 


ranR9 

repo^i 


OA 1 9C CI 1 


GCA GCG GCA 

U\^M vjwU Uwn 




can 1 


rpnfiR 


96 1 25 51 1 


GCA GCG GCA 

W www w vn 




seq. 482 


rep40 


96 125 511 


GCA GCG GCA 


35 


seq.483 


rep78 


54 163 


GCC GCT 




seq. 484 


rep68 


54 163 


GCC GCT 




seq .485 


rep78 


1 97 420 


GCG GCC 




seq. 486 


rep52 


1 97 420 


GCG GCC 




seq.487 


rep68 


1 97 420 


GCG GCC 


40 


seq. 488 


rep40 


1 97 420 


GCG GCC 




seq.489 


rep78 


334 428 499 


GCG GCT GCC 




seq. 490 


rep52 


334 428 499 


GCG GCT GCC 




seq. 491 


rep68 


334 428 499 


GCG GCT GCC 




seq. 49 2 


rep40 


334 428 499 


GCG GCT GCC 


45 


seq.493 


rep78 


197 414 


GCG GCT 




seq. 494 


rep 5 2 


197 414 


GCG GCT 




seq .49 5 


rep68 


197 414 


GCG GCT 




seq.496 


rep40 


197 414 


GCG GCT 




seq. 497 


rep78 


30 54 1 27 


GCG GCC GCT 


50 


seq, 498 


rep68 


30 54 1 27 


GCG GCC GCT 




seq. 499 


rep78 


29 260 


GCG GCG 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



seq.500 


rep52 


29 260 


GCG GCG 


seq.501 


rep68 


29 260 


GCG GCG 


seq.502 

* 


rep40 


29 260 


GCG GCG 


seq.503 


rep78 


4 484 


GCT GCC 


seq. 504 


rep 5 2 


4 484 


GCT GCC 


seq. 505 


rep68 


4 484 


GCT GCC 


seq.506 


rep40 


4 484 


GCT GCC 


seq.507 


rep78 


258 124 132 


GCC GCC GCC 


seq.508 


rep52 


258 124 132 


GCC GCC GCC 


seq.509 


rep68 


258 124 132 


GCC GCC GCC 


seq.510 


rep40 


258 124 132 


GCC GCC GCC 


seq.51 1 


rep78 


231 497 


GCC GCC 


seq.512 


rep 5 2 


231 497 


GCC GCC 


seq.51 3 


rep 6 8 


231 497 


GCC GCC 


seq.51 4 


rep40 


231 497 


GCC GCC 


seq.51 5 


reD78 


221 258 


GCA GCC 


seq.51 6 


rep 5 2 


221 258 


GCA GCC 


seq.51 7 


rep68 


221 258 


GCA GCC 


seq .518 


rep40 


221 258 


GCA GCC 


seq.51 9 


rep78 


234 264 326 


GCG GCG GCC 


seq. 520 


rep 5 2 


234 264 326 


GCG GCG GCC 


seq. 521 


rep68 


234 264 326 


GCG GCG GCC 


seq. 522 


rep40 


234 264 326 


GCG GCG GCC 


seq. 523 


rep78 


153 398 


AGC GCG 


seq. 524 


rep52 


153 398 


AGC GCG 


seq. 525 


rep68 


153 398 

• x^ x^ 


AGC GCG 


seq. 526 


rep40 


153 398 


AGC GCG 


sea. 527 


reD78 


53 21 6 

WW 4— 1 w 


GCG GCC 


sea. 528 


reD68 


53 216 




sea. 529 


reD78 


22 382 


GCT GPfi 


sea. 530 


reo52 


22 382 


GCT RPR 


sea. 531 " 


reD68 


22 382 


gct firn 


sea 532 


rpn4-0 

1 OUTw 


22 *5fl2 


1 Uv«VJ 


seq. 533 


rep78 


231 411 


GCC GCA 


seq. 534 


rep52 


231 411 


GCC GCA 


seq. 535 


rep68 


231 411 


GCC GCA 


seq. 536 


rep40 


231 411 


GCC GCA 


seq. 537 


rep78 


59 305 


GCG GCC 


seq. 538 


rep52 


59 305 


GCG GCC 


seq. 539 


rep68 


59 305 


GCG GCC 


seq. 540 


rep40 


59 305 


GCG GCC 


seq. 541 


rep78 


53 231 


GCG GCC 


seq. 542 


rep52 


53 231 


GCG GCC 


seq. 543 


rep68 


53 231 


GCG GCC 


seq. 544 


rep40 


53 231 


GCG GCC 


seq. 545 


rep78 


258 498 


GCC GCT 


seq. 546 


rep52 


258 498 


GCC GCT 


seq. 547 


rep68 


258 498 


GCC GCT 


seq. 548 


rep40 


258 498 


GCC GCT 


seq. 549 


rep78 


88 231 


GCC GCC 


seq. 550 


rep 5 2 


88 231 


GCC GCC 
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seq.551 


rep 6 8 


88 231 


GCC GCC 




seq.552 


rep40 


88 231 


GPP GCC 




seq.553 


rep78 


101 363 


GCA GCC 




seq.554 


rep 52 


101 363 


GCA GCC 


5 


seq.555 


rep68 


lOI 363 


GCA GCC 




seq.556 


rep40 


101 363 


GPA GCP 




seq.557 


rep78 




GPP GPP 




seq.558 


rep 5 2 


OKA 1 OO 


GCC fipp 




seq.559 


rep68 




GCC GCC 


10 


seq.560 


rep40 


OKA 1 09 

Out 1 OZ, 


fiPP GCC 




seq.561 


rep78 




r?pf? r^pp 




seq.562 


rep68 


in 1 QO 


rspf? ppp 




DNA Sequences 








Sequence 


aa position 


uuuun 




15 


seq.563 


4 


r?PT 






sea. 564 


10 








sea. 566 


20 








sea. 566 


22 


UU 1 






seq.567 


29 








seq.568 


38 


gcg 






seq.569 


39 








seq.570 


53 


uL 1 






seq.571 


59 


rsprs 






seq.572 


64 


f5PT 




25 


seq.573 


74 








seq.574 


86 


pprs 

uL-u 






seq.575 


88 




• 




seq.576 


101 








seq.577 


124 


f2PP 




ou 


seq.578 


125 








seq.579" 


127 


PPT 






seq.580 


132 


/"» PP 






seq.581 


140 


GCC 






seq.582 


161 


GCC 




35 


seq.583 


163 


GCT 






seq.584 


175 


GCT 






seq.585 


193 


GCG 






seq.586 


196 


GCC 






seq.587 


197 


GCC 




40 


seq.588 


221 


GCA 






seq.589 


228 (Rep78/68) 


GCG 








228 (Rep52) 


GCG 








228 (Rep 40) 


GCG 






seq.590 


231 (Rep78/68) 


GCC 




45 




231 (Rep 52) 


GCC 








231 (Rep 40) 


GCC 






seq.591 


234 (Rep78/68) 


GCG 








234 (Rep 52) 


GCG 








234 (Rep 40) 


GCG 




50 


seq.592 


237 (Rep78/68) 


GCC 
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237 (Reo 52) 


GCC 

V»J V-f \w* 




237 (ReD 40) 


GCC 


sea. 593 


250 {Rep78/68) 


GCC 




250 


GCC 




250 


GCC 

VJ v* w 


sea 594- 

«JUV| « W W^^ 


258 (ReD78/68) 


GCC 

vj v_/ 




258 

^wO 


GCC 




9R8 

^w w 




sea 595 

«3v^ • WW W 


260 (Reo78/68) 


GCG 

V — 1 V^ V — P 




260 


Vj wVj 




960 


V3 V— > VJ 


sea 596 


263 fRea78/68> 

^-WW \ 1 IQp / W/ WW/ 


GCC 

V* w 




26*3 

£ WW 






9M 

^ w w 




sen 597 

OCv^ ■ W w / 


264 fRea78/68) 


GCG 

VJ Vw» VJ 




9fiA 
<tO*r 


ncn 








eon RQfi 


3*54 fRpn78/6Rl 

ww^r \ ncp / Of \JO J 


VJ V> VJ 






aca 




w w*r 


\J V^VJ 


sea 599 

o cu ♦ w w w 


335 (Rer»78/681 

www tnc(/ / o/ wo/ 


GCT 

VlV« 1 




www 


f5PT 




www 


GCT 
VJ 1 


sen fiOO 

Oul^i www 


337 fReo78/68) 


VJ V> 1 




WW / 


1 




T37 

WW / 


r;pT 


can fiOl 


w*r I inc|J/ O/uoj 


gcc 




w*f I 






0*+ | 




ocLj.Uwi 


^49 /R<»n7ft/fiftl 


r;rp 




o*f ^ 


uLL 


- 






can ft^Q 
. DUO 


w*K/ inep/o/oo/ 






347 


GCA 




347 


GCA 


seq.604 


350 (Rep78/68) 


AAT 




350 


AAT 




350 


AAT 


seq.605 


350 (Rep78/68) 


GCT 




350 


GCT 




350 


GCT 


seq.606 


354 (Rep78/68) 


GCC 




354 


GCC 




354 


GCC 


seq.607 


363 (Rep78/68) 


GCC 




363 


GCC 




363 


GCC 


seq.608 


364 (Rep78/68) 


GCT 




364 


GCT 




364 


GCT 


seq.609 


367 (Rep78/68) 


GCC 
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367 


GCC 




367 


GCC 


seq.610 


370 (Rep78/68) 


GCC 




370 


GCC 




370 


GCC 

>bJ V^ 


seq.61 1 


376 (Rep78/68) 


GCG 




376 


GCG 

Vrf 




376 


GCG 


sea. 61 2 


381 (ReD78/68) 


GCG 




381 


GCG 




381 

w VJ I 


GCG 

VJ V-» VJ 


sea 61 3 


382 (ReD78/68) 


GCG 

V»J V-^ 




382 

«J w 4L. 


GCG 




382 


GCG 


sea 614 


389 {Reo78/68) 


GCG 




389 

wUw 


GCG 




389 
«j \j 


GCG 

VJ VJ 


sea. 61 5 


407 (ReD78/68) 


GCC 

VJ V* V* 




407 


GCC 




407 


GCC 


sea 616 


41 1 (ReD78/68) 


GCA 

vj v#r» 




41 1 


GCA 

VJ V^AA 




41 1 


RCA 


sea 617 


41 4 <Rea78/68) 

»^ l «^ \ l » v> ^J / VJ/ vjvj/ 


GCT 




414 
tit 






414 


GCT 


sea 618 


420 (ReD78/68) 


GCT 

VJ V— ' 1 




490 


GCT 

VJ V^ 1 




490 


GCT 

VJ V_» I 


sea 619 


421 (ReD78/68) 

* ^— 1 \ 1 IG|J f \Jf KJKJf 


GCC 

VJ vv 




49 1 


GCC 


_ 


A 9 1 


GCC 


con fi90 




GCC 




422 


GCC 




422 


GCC 


seq.621 


424 (Rep78/68) 


GCG 




424 


GCG 




424 


GCG 


seq.622 


428 (Rep78/68) 


GCT 




428 


GCT 




428 


GCT 


seq.623 


429 (Rep78/68) 


GCC 




429 


GCC 




429 


GCC 


seq.624 


438 (Rep78/68) 


GCG 




438 


GCG 




438 


GCG 


seq.625 


440 (Rep78/68) 


GCG 




440 


GCG 




440 


GCG 


seq.626 


451 (Rep78/68) 


GCC 
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451 


GCC 




451 


GCC 


seq.627 


460 (Rep78/68) 


GCG 




460 


GCG 




460 


GCG 


seq.628 


462 (Rep78/68) 


GCC 




462 


GCC 

Xpt Xb* Xb* 




462 


GCC 


seq.629 


462 (Rep78/68) 


ATA 




462 


ATA 




462 


ATA 


seq.630 


484 (Rep78/68) 


GCC 




484 


GCC 

X^ x^ x^ 




484 


GCC 

X»* X* x^ 


seq.631 


488 (Rep78/68) 


GCG 




488 


GCG 




488 

■■■■ vp* 


GCG 


seq.632 


495 (ReD78/68) 


GCC 

XI Xb*x^ 




495 


GCC 




495 

^* w x/ 


GCC 


seq.633 


497 (Rep78/68) 


GCC 




497 


GCC 

XV X^Xb* 




497 


GCC 

XJ x> x^ 


seq.634 


497 (Rep78/68) 


CGA 




497 


CGA 




497 


CGA 


seq.635 


497 (Rep78/68) 


CTC 

x#* * x^ 




497 


CTC 




497 


CTP 


seq.636 


497 (ReD78/68) 


TAC 




497 


TAP 


• 


497 


TAP 


sea 637 


498 fReD78/68> 


GCT 




498 


GCT 




498 


GCT 


seq.638 


499 (Rep78/68) 


GCC 




499 


GCC 




499 


GCC 


seq.639 


503 (Rep78/68) 


GCG 




503 


GCG 




503 


GCG 


seq.640 


510 (Rep78/68) 


GCA 




510 


GCA 




510 


GCA 


seq.641 


511 (Rep78/68) 


GCA 




511 


GCA 




511 


GCA 


seq.642 


512 (Rep78/68) 


GCT 




512 


GCT 




512 


GCT 


seq.643 


516 (Rep78/68) 


GCG 
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seq.644 

5 

seq.645 



seq.646 

10 

seq.647 



15 seq.648 
seq.649 
seq.650 
seq.651 

20 

seq.652 
seq.653 



25 seq.654 
seq.655 

30 

seq.656 
seq.657" 

35 seq.658 

seq.659 

40 

seq.660 
seq.661 

45 

seq.662 



50 seq.663 
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516 
516 

517 (Rep78/68) 
517 
517 

517 (Rep78/68) 
517 
517 

518 (Rep78/68) 
518 
518 

519 (Rep78/68) 
519 . 
519 

598 (Rep78/68) 

600 (Rep78/68) 

601 (Rep78/68) 
335 420 495 
335 420 495 
335 420 495 
39 140 
279 428 451 
279 428 451 
279 428 451 
1 25 237 600 
125 237 600 
125 237 600 
163 259 
163 259 
163 259 
17 127 189 
350 428 
350 428 
350 428 
54 338 495 
54 338 495 
54 338 495 
350 420 
350 420 
350 420 
189 197 518 
189 197 518 
189 197 518 
468 516 
468 516 
468 516 
127 221 350 54 140 
127 221 350 54 140 
127 221 350 54 140 
221 285 

221 285 



GCG 
GCG 
GCT 
GCT 
GCT 
AAC 
AAC 
AAC 
GCA 
GCA 
GCA 
GCG 
GCG 
GCG 
GCA 
GCG 
GCA 

GCT GCC GCC 
GCT GCC GCC 
GCT GCC GCC 
GCA GCC 
GCC GCT GCC 
GCC GCT GCC 
GCC GCT GCC 
GCG GCC GCG 
GCG GCC GCG 
GCG GCC GCG 
GCT GCG 
GCT GCG 
GCT GCG 
GCG GCT GCG 
GCT GCT 
GCT GCT 
GCT GCT 
GCC GCC GCC 
GCC GCC GCC 
GCC GCC GCC 
GCT GCC 
GCT GCC 
GCT GCC 
GCG GCG GCA 
GCG GCG GCA 
GCG GCG GCA 
GCC GCG 
GCC GCG 
GCC GCG 

GCT GCA GCT GCC GCC 
GCT GCA GCT GCC GCC 
GCT GCA GCT GCC GCC 
GCA GCG 
GCA GCG 
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221 285 


GCA GCG 


seq.664 


23 495 


GCT GCC 




23 495 


GCT GCC 




23 495 


GCT GCC 


seq.665 


20 54 420 495 


GCC GCC GCC GCC 




20 54 420 495 


GCC GCC GCC GCC 




20 54 420 495 


GCC GCC GCC GCC 


seq.666 


412 612 


GCC GCG 




412 612 


GCC GCG 




412 612 


GCC GCG 


seq.667 


197 412 


GCG GCC 




197 412 


GCG GCC 




197 412 


GCG GCC 


seq.668 


412 495 511 


GCC GCC GCA 




412 495 511 


GCC GCC GCA 




412 495 511 


GCC GCC GCA 


seq.669 


98 422 


GCC GCC 




98 422 


GCC GCC 




98 422 


GCC GCC 


seq.670 


17 127 189 


GCG GCT GCG 


seq.671 


20 54 495 


GCC GCC GCC 




20 54 495 


GCC GCC GCC 



20 54 495 GCC GCC GCC 

seq.672 54 163 GCC GCT 

25 seq.673 259 54 GCG GCC 

259 54 GCG GCC 

259 54 GCG GCC 

seq.674 335 399 GCT GCG 

335 399 GCT GCG 

30 335 399 GCT GCG 

seq.675 221 432 GCA GCA 

221 432 GCA GCA 

221 432 GCA GCA 

seq.676 259 516 GCG GCG 

35 259 516 GCG GCG 

259 516 GCG GCG 

seq.677 495 516 GCC GCG 

495 516 GCC GCG 

495 516 GCC GCG 

40 seq.678 414 14 GCT GCC 

414 14 GCT GCC 

414 14 GCT GCC 

seq.679 74 402 495 GCG GCC GCC 

74 402 495 GCG GCC GCC 

45 74 402 495 GCG GCC GCC 

seq.680 228 462 497 GCC GCC GCC 

228 462 497 GCC GCC GCC 

228 462 497 GCC GCC GCC 

seq.681 290 338 GCG GCC 

50 290 338 GCG GCC 

290 338 GCG GCC 
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seq.682 140 51 1 GCC GCA 

140 511 GCC GCA 

140 511 GCC GCA 

seq.683 86 378 GCG GCG 

5 86 378 GCG GCG 

86 378 GCG GCG 

seq.684 54 86 GCC GCG 

54 86 GCC GCG 

54 86 GCC GCG 

10 seq.685 214 495 140 GCG GCC GCC 

214 495 140 GCG GCC GCC 

214 495 140 GCG GCC GCC 

seq.686 495 51 1 GCC GCA 

495 51 1 GCC GCA 

15 495 511 GCC GCA 

seq.687 495 54 GCC GCC 

495 54 GCC GCC 

495 54 GCC GCC 

seq.688 197 495 GCG GCC 

20 197 495 GCG GCC 

197 495 GCG GCC 

seq.689 261 20 GCC GCC 

261 20 GCC GCC 

261 20 GCC GCC 

25 seq.690 54 20 GCC GCC 

seq.691 197 420 GCG GCC 

197 420 GCG GCC 

197 420 GCG GCC 

seq.692 54 338 495 GCC GCC GCC 

30 54 338 495 GCC GCC GCC 

54 338 495 GCC GCC GCC 

seq.693" 197 427 GCG GCG 

197 427 GCG GCG 

197 427 GCG GCG 

35 seq.694 54 228 370 387 GCC GCC GCC GCG 

54 228 370 387 GCC GCC GCC GCG 

54 228 370 387 GCC GCC GCC GCG 

seq.695 221 289 GCA GCC 

221 289 GCA GCC 

40 221 289 GCA GCC 

seq.696 54 163 GCC GCT 

54 1 63 GCC GCT 

seq.697 341 407 420 GCC GCC GCC 

341 407 420 GCC GCC GCC 

45 341 407 420 GCC GCC GCC 

seq.698 54 228 GCC GCC 

54 228 GCC GCC 

54 228 GCC GCC 

seq.699 96 125 511 GCA GCG GCA 

50 96 1 25 51 1 GCA GCG GCA 

96 125 511 GCA GCG GCA 
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seq.700 197 420 GCG GCC 

197 420 GCG GCC 

197 420 GCG GCC 

seq.701 334 428 499 GCG GCT GCC 

5 334 428 499 GCG GCT GCC 

334 428 499 GCG GCT GCC 

seq.702 197 414 GCG GCT 

197 414 GCG GCT 

197 414 GCG GCT 

10 seq.703 30 54 127 GCG GCC GCT 

seq.704 29 260 GCG GCG 

29 260 GCG GCG 

29 260 GCG GCG 

seq.706 4 484 GCT GCC 

15 4 484 GCT GCC 

4 484 GCT GCC 

seq.707 258 124 132 GCC GCC GCC 

258 124 132 GCC GCC GCC 

258 124 132 GCC GCC GCC 

20 seq.708 231 497 GCC GCC 

231 497 GCC GCC 

231 497 GCC GCC 

seq.709 221 258 GCA GCC 

221 258 GCA GCC 

25 221 258 GCA GCC 

seq.710 234 264 326 GCG GCG GCC 

234 264 326 GCG GCG GCC 

234 264 326 GCG GCG GCC 

seq.711 153 398 AGC GCG 

30 153 398 AGC GCG 

1 53 398 AGC GCG 

seq.712" 53 216 GCG GCC 

seq.713 22 382 GCT GCG 

22 382 GCT GCG 

35 22 382 GCT GCG 

seq.714 231 411 GCC GCA 

231 411 GCC GCA 

231 41 1 GCC GCA 

seq.715 59 305 GCG GCC 

40 59 305 GCG GCC 

59 305 GCG GCC 

seq.716 53 231 GCG GCC 

53 231 GCG GCC 

53 231 GCG GCC 

45 seq.717 258 498 GCC GCT 

258 498 GCC GCT 

258 498 GCC GCT 

seq.718 88 231 GCC GCC 

88 231 GCC GCC 

50 88 231 GCC GCC 

seq.719 101 363 GCA GCC 



WO 03/018820 



PCT/IB02/04087 



-76- 

101 363 GCA GCC 

101 363 GCA GCC 

seq.720 354 132 GCC GCC 

354 132 GCC GCC 

5 354 132 GCC GCC 

seq.726 598 GAC 

seq.727 598 AGC 

seq.728 600 CCG 

The above nucleic acid molecules are provided in plasmids, which 
10 are introduced into cells to produce the encoded proteins. The analysis 
revealed the amino acid positions that affect Rep proteins activities. 
Changes of amino acids at any of the hit positions result in altered protein 
activity. Hit positions are numbered and referenced starting from amino 
acid 1 (nucleotide 321 in AAV-2 genome), also codon 1 of the protein 
15 Rep78 coding sequence under control of p5 promoter of AAV-2: 4, 20, 
22, 29, 32, 38, 39, 54, 59, 124, 125, 127, 132, 140, 161, 163, 193, 
196, 197, 221, 228, 231, 234, 258, 260, 263, 264, 334, 335, 337, 
342, 347, 350, 354, 363, 364, 367, 370, 376, 381, 389, 407, 411, 
414, 420, 421, 422, 424, 428, 438, 440, 451, 460, 462, 484, 488, 
20 495, 497, 498, 499, 503, 511, 512, 516, 517, 518, 542, 548, 598, 
600 and 601. The encoded Rep78, Rep68, Rep 52 and Rep 40 proteins 
and rAAV encoding the mutant proteins are provided. The corresponding 
nucleic acid molecules, Rep proteins, rAAV and cells containing the 
nucleic acid molecules or rAAV in which the native proteins are from 
25 other AAV serotypes, including, but are not limited to, AAV-1 , AAV-3, 
AAV-3B, AAV-4, AAV-5 and AAV-6. 

Other hit positions identified include: 10, 64, 74, 86, 88, 101, 
175, 237, 250, 334, 429 and 519. 

Also provided are nucleic acid molecules, the rAAV that encode 
30 the mutant proteins, and the encoded proteins in which the native amino 
acid at each hit position is replaced with another amino acid, or is 
deleted, or contains additional amino acids at or adjacent to or near the 
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hit positions. In particular the following nucleic acid molecules and rAAV 
that encode proteins containing the following amino acid replacements or 
combinations thereof: T by N at Hit position 350; T by I at Hit position 
462; P by R at Hit position 497; P by L at Hit position 497; P by Y at Hit 
5 position 497; T by N at Hit position 517; L by S at hit position 542; R by 
S at hit position 548; G by D at Hit position 598; G by S at Hit position 
598; V by P at Hit position 600; in order to increase Rep proteins 
activities in terms on AAV or rAAV productivity. The corresponding 
nucleic acid molecules, recombinant Rep proteins from the other 

10 serotypes and the resulting rAAV are also provided (see Figs. 3 and the 
above Table for the corresponding position in AAV-1, AAV-3, AAV-3B, 
AAV-4, AAV-5 and AAV-6). 

Mutant adeno-associated virus (AAV) Rep proteins and viruses 
encoding such proteins that include mutations at one or more of residues 

15 64, 74, 88, 175, 237, 250 and 429, where residue 1 corresponds to 
residue 1 of the Rep78 protein encoded by nucleotides 321-323 of the 
AAV-2 genome, and where the amino acids are replaced as follows: L by 
A at position 64; P by A at position 74; Y by A at position 88; Y by A at 
position 175; T by A at position 237; T by A at position 250; D by A at 

20 position 429 are provided. Nucleic acid molecules encoding these 
viruses and the mutant proteins are also provided. 

Also provided are nucleic acid molecules produced from any of the 
above-noted nucleic acid molecules by any directed evolution method, 
including, but are not limited to, re-synthesis, mutagenesis, recombination 

25 and gene shuffling and any way by combining any combination of the 

molecules, i.e., one, two by one, two by two, n by n, where n is the 

number of molecules to be combined ( i.e., combining all together). The 
resulting recombinant AAV and encoded proteins are also provided. 
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Also provided are nucleic acid molecule in which additional amino 
acids surrounding each hit, such as one, two, three . . . ten or more, 
amino acids are systematically replaced, such that the resulting Rep 
protein(s) has increased or decreased activity. Increased activity as 
5 assessed by increased recombinant virus production in suitable cells is of 
particular interest for production of recombinant viruses for use, for 
example, in gene therapy. 

Also provided are combinations of the above noted mutants in 
which several of the noted amino acids are changed and optionally 
10 additional amino acids surrounding each hit, such as one, two, three . . . 
ten or more, are replaced. 

For all of the mutant proteins provided herein those with increased 
activity, such as an increase in titer of rAAV when virus containing such 
mutations and/or expressing such mutant proteins are replicated, are of 
15 particular interest. Such mutations and proteins are provided herein and 
may be made by the methods herein, including by combining any of the 
mutations provided herein to produce additional mutant proteins that have 
altered biological activity, particularly increased activity, compared to the 
wild-type. 

20 The nucleic acid molecules of SEQ ID Nos. 563-725 and the 

encoded proteins (SEQ ID Nos. 1-562 and 726-728) are also provided. 
Recombinant AAV and cells containing the encoding nucleic acids are 
provided, as are the AAV produced upon replication of the AAV in the 
cells. 

25 Methods of in vivo or in vitro production of AAV or rAAV using 

any of the above nucleic acid molecules or cells for intracellular 
expression of rep proteins or the rep gene mutants are provided. In vitro 
production is effected using cell free systems, expression or replication 
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and/or virus assembly. In vivo production is effected in mammalian cells 
that also contain any requisite cis acting elements required for packaging. 

Also provided are nucleic acid molecules and rAAV (any serotype) 
in which position 630 (or the corresponding position in another serotype; 
5 see Figs. 3 and the table above) has been changed. Changes at this 

position and the region around it lead to changes in the activity or in the 
quantities of the Rep or Cap proteins and/or the amount of AAV or rAAV 
produced in cells transduced with AAV encoding such mutants. Such 
mutations include tgc to gcg change (SEQ ID No. 721). Mutations at any 

10 position surrounding the codon position 630 that increase or decrease the 
Rep or Cap proteins quantities or activities are also provided. Methods 
using the rAAV (any serotype) that contain nucleic acid molecules with a 
mutation at position 630 or within 1, 2, 3 ... .10 or more bases thereof 
for the intracellular expression rep proteins or the rep gene mutants 

15 covered by claims 10 to 13, for the production of AAV or rAAV (either in 
vitro, in vivo or ex vivo) are provided. In vitro methods include cell free 
systems, expression or replication and/or virus assembly. 

Also provided are rAAV (and other serotypes with corresponding 
changes) and nucleic acid molecules encoding an amino acid replacement 

20 by N at Hit position 350 of AAV- 1 , AAV-3, AAV-3B, AAV-4 and AAV-6 
or at Hit position 346 of AAV-5; by I at Hit position 462 of AAV-1, 
AAV-3, AAV-3B, AAV-4 and AAV-6 or at Hit position 458 of AAV-5; by 
either R, L or Y at Hit position 497 of AAV-1 , AAV-3, AAV-3B, AAV-4 
and AAV-6 or at Hit position 493 of AAV-5; by N at Hit position 517 of 

25 AAV-1, AAV-3, AAV-3B, AAV-4 and AAV-6 or at Hit position 535 of 
AAV-5; by S at hit position 543 of AAV-1 and AAV-6 or at hit position 
542 of AAV-3, AAV-3B and AAV-4 or at hit position 561 of AAV-5; by S 
at hit position 549 of AAV-1 and AAV-6 or at hit position 548 of AAV-3, 
AAV-3B and AAV-4 or at hit position 567 of AAV-5; by either D or S at 
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Hit position 599 of AAV-1, AAV-4 and AAV-6 or at Hit position 600 of 
AAV-3 and AAV-3B; by P at Hit position 602 of AAV-1, AAV-4 and AAV- 
6 or at hit position 603 of AAV-3 and AAV-3B or at hit position 589 of 
AAV-5 in order to increase Rep proteins activities as assessed by AAV or 
5 rAAV productivity. Methods using such AAV for expression of the 
encoded proteins and production of AAV are also provided. 



10 



Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
claims. 
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WHAT IS CLAIMED IS: 

1. An adeno-associated virus (AAV), comprising nucleic acid 
encoding the sequence of amino acids in any of SEQ ID Nos. 1-562 and 
726-728 or encoding a sequence of amino acids encoded by SEQ ID Nos. 

5 722-725. 

2. The AAV of claim 1, wherein the sequence of nucleotides 
encoding the sequence of amino acids is set forth in SEQ ID Nos. 563- 
725. 

3. The AAV of claim 1 that has an altered activity in a Rep 
10 protein and/or a capsid protein. 

4. The AAV of claim 3, wherein the alteration leads to greater 
activity in the Rep gene manifested as an increased titer of virus upon 
introduction and replication in a host cell compared to the titer of virus 
upon introduction and replication of a wild type Rep gene. 

15 5. The AAV of claim 1 that is of serotype AAV-1, AAV-2, 

AAV-3, AAV-3B, AAV-4, AAV-5 or AAV-6. 

6. A mutant adeno-associate virus (AAV) Rep protein, 
comprising mutations at one or more of residues 4, 20, 22, 29, 32, 38, 
39, 54, 59, 124, 125, 127, 132, 140, 161, 163, 193, 196, 197, 221, 

20 228, 231, 234, 258, 260, 263, 264, 334, 335, 337, 342, 347, 350, 
354, 363, 364, 367, 370, 376, 381, 389, 407, 411, 414, 420, 421, 
422, 424, 428, 438, 440, 451, 460, 462, 484, 488, 495, 497, 498, 
499, 503, 511, 512, 516, 517, 518, 542, 548, 598, 600 and 601 of 
AAV-2 or the corresponding residues in other serotypes, wherein residue 

25 1 corresponds to residue 1 of the Rep78 protein encoded by nucleotides 
321-323 of the AAV-2 genome, wherein the mutations comprise 
insertions, deletions or replacements of the native amino acid residue(s). 

7. The Rep protein of claim 6 that is Rep 78, Rep 68, Rep 52 or 
Rep 40. 
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8. The mutant AAV Rep protein of claim 6, wherein the AAV is 
an AAV-1, AAV-2, AAV-3, AAV-3b, AAV-4, AAV-5 or AAV-6, wherein 
the mutation is in the equivalent position in each serotype, wherein the 
listed residues are the positions in AAV-2. 
5 9- A mutant AAV Rep protein of claim 6 that has increased 

activity compared to the native protein, wherein activity is assessed by 
measuring viral production when an AAV that encodes the protein is 
introduced into a cell under conditions wherein the virus replicates. 

10. A mutant AAV Rep protein of claim 6 that has decreased 
10 activity compared to the native protein, wherein activity is assessed by 

measuring viral production when an AAV that encodes the protein is 
introduced into a cell under conditions wherein the virus replicates. 

11. A mutant Rep protein of claim 6, further comprising a 
mutation at one or more of residues 10, 64, 74, 86, 88, 101, 175, 237, 

15 250, 334, 429 and 519. 

12. The mutant Rep protein of claim 6, wherein the amino acids 
are replaced as follows: T by N at position 350; T by I at position 462; P 
by R at position 497; P by L at position 497; P by Y at position 497; T by 
N at position 517; G by D at position 598; G by S at position 598; V by P 

20 at position 600, whereby the activity of the Rep protein is increased as 
assessed by rAAV production compared to the native Rep protein. 

13. A mutant Rep protein of claim 6, comprising two or more of 
the mutations. 

14. A mutant adeno-associate virus (AAV) Rep protein, 

25 comprising mutations at one or more of residues 64, 74, 88, 1 75, 237, 
250 and 429, wherein: residue 1 corresponds to residue 1 of the Rep78 
protein encoding by nucleotides 321-323 of the AAV-2 genome; 
wherein the amino acids are replaced as follows: 
L by A at position 64; 
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P by A at position 74; 

Y by A at position 88; 

Y by A at position 175; 
T by A at position 237; 

5 T by A at position 250; 

D by A at position 429; 
the mutations comprise insertions, deletions or replacements of the 
native amino acid residue. 

15. A nucleic acid molecule encoding the protein of claim 6. 
10 1 6. A recombinant AAV comprising the nucleic acid molecule of 

claim 15. 

17. A eukaryotic cell, comprising the recombinant AAV of claim 

16. 

18. A collection of nucleic acid molecules comprising a plurality of 
15 the molecules of claim 17. 

1 9. A collection of nucleic acid molecules comprising a plurality of 
the molecules of claim 15. 

20. An isolated nucleic acid molecule encoding the proteins of 
SEQ ID'Nos. 1-562 and 726-728 or encoding a sequence of amino acids 

20 encoded by SEQ ID Nos. 722-725. 

21. A Rep protein of any of SEQ ID Nos. 1-562 and 726-728 or 
encoding a sequence of amino acids encoded by SEQ ID Nos. 722-725. 

22. A Rep protein encoded by any of SEQ ID Nos. 564-725. 

23. A method for intracellular expression of a mutant Rep 
25 protein, comprising: 

introducing the recombinant AAV of claim 1 6 into a host 

cell; and 

culturing the cell, under conditions and in which the AAV 
Rep proteins are expressed. 
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24. The method of claim 23, wherein the AAV replicates. 

25. An AAV genome, comprising a mutation at one or more of 
nucleotides corresponding to nucleotides 2209-221 1 of the AAV-2 
genome, which encode amino acid residue 630 of the Rep78 protein, 

5 wherein: 

the mutation is a deletion, insertion or replacement of a nucleotide; 

and the mutation results in a change in the activity or in the 
quantities of the Rep or Cap proteins as assessed by the level of 
replication of the AAV genome. 
10 26. The AAV genome of claim 25, wherein the mutation at 

position 630 is a tgc to gcg and the intron comprises the sequence (SEQ 
ID No. 722): 

gtaccaaaacaaatgttctcgtcacgtgggcatgaatctgatgctgtttccctgc 
agacaatgcgagagaatgaatcagaattcaaatatctgcttcactcacggacaga 

15 aagactgtttagagtgctttcccgtgtcagaatctcaacccgtttctgtcgtcaa 
aaaggcgtatcagaaactgtgctacattcatcatatcatgggaaaggtgccagac 
gcttgcactgcctgcgatctggtcaatgtggatttggatgactgcatctttgaac 
aataaatgatttaaatcaggtatggcgcgcgatggttatcttccag. 

27. The AAV genome of claim 25, wherein the mutation at 

20 position 630 is a tgc to cgc and the intron comprises the sequence (SEQ 
ID No. 723): 

gtaccaaaacaaatgttctcgtcacgtgggcatgaatctgatgctgtttcc 
ctgcagacaatgcgagagaatgaatcagaattcaaatatctgcttcactcac 
ggacagaaagactgtttagagtgctttcccgtgtcagaatctcaacccgtttc 
25 tgtcgtcaaaaaggcgtatcagaaactgtgctacattcatcatatcatgggaa 
aggtgccagacgcttgcactgcctgcgatctggtcaatgtggatttggatgac 
tgcatctttgaacaataaatgatttaaatcaggtatggccgccgatggttatc 
ttccag. 
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28. The AAV genome of claim 25, wherein the mutation at 
position 630 is a tgc to get and the intron comprises the sequence (SEQ 
ID No, 724): 

gtaccaaaacaaatgttctcgtcacgtgggcatgaatctgatgctgttt 
5 ccctgcagacaatgcgagagaatgaatcagaattcaaatatctgcttcac 
tcacggacagaaagactgtttagagtgctttcccgtgtcagaatctcaac 
ccgtttctgtcgtcaaaaaggcgtatcagaaactgtgctacattcatcat 
atcatgggaaaggtgccagacgcttgcactgcctgcgatctggtcaatgt 
ggatttggatgactgcatctttgaacaataaatgatttaaatcaggt 
10 atggccctcgatggttatcttccag. 

29. The AAV genome of claim 25, wherein the mutation at 
position 630 is a tgc to tea and the intron comprises the sequence (SEQ 
ID No. 725): 

gtaccaaaacaaatgttctcgtcacgtgggcatgaatctgatgctgtttcc 
15 ctgcagacaatgcgagagaatgaatcagaattcaaatatctgcttcactca 
cggacagaaagactgtttagagtgctttcccgtgtcagaatctcaacccgt 
ttctgtcgtcaaaaaggcgtatcagaaactgtgctacattcatcatatcat 
gggaaaggtgccagacgcttgcactgcctgcgatctggtcaatgtggattt 
ggatgactgcatctttgaacaataaatgatttaaatcaggtatggctcacg 
20 atggttatcttccag . 

30. A method for intracellular expression of a mutant Rep 
protein, comprising: 

introducing the recombinant AAV of claim 25 into a host 

cell; and 

25 culturing the cell, under conditions and in which the AAV 

Rep proteins and/or cap proteins are expressed. 

31 . The method of claim 30, wherein the AAV replicates. 

32. The AAV genome of claim 25, wherein the AAV is of 
serotype AAV-1, AAV-3, AAV-3B, AAV-4, AAV-5 or AAV-6. 
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33. A method of titering virus by a method designated tagged 
replication and expression enhancement, comprising: 

(i) incubating host cells with a reporter virus vector 
and with a titering virus of unknown titer, wherein a titering virus 

5 increases or decreases the output signal from the reporter virus; and 

(ii) measuring the output signal of the reporter virus 
and determining the titer of the reporter virus; and 

(ii) determining the titer of the titering virus by 
comparing the titer of the reporter virus in the presence and absence of 
10 the titering virus. 

34. A process for the production of an adeno-associated virus 
(AAV) protein or a recombinant AAV having a predetermined property, 
comprising: 

(a) producing a population of sets of nucleic acid molecules that 
1 5 encode modified forms of a target protein; 

(b) introducing each set of nucleic acid molecules into host cells 
and expressing the encoded protein, wherein the host cells are present in 
an addressable array; 

(c) individually screening the sets of encoded proteins to identify 
20 one or more proteins that have activity that differs from the target 

protein, wherein each such protein is designated a hit; 

(d) modifying the nucleic acid molecules that encode the hits, to 
produce a set of nucleic acid molecules that encode modified hits, 
wherein the nucleic acid molecules comprise rAAV vectors; 

25 (e) introducing the each set of nucleic acids that encode the 

modified hits into cells; and 

(f) individually screening the sets of cells that contain the nucleic 
acid molecules that encode the modified hits to identify one or more cells 
that encodes a protein that has activity that differs from the target protein 
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and has properties that differ from the original hits, wherein each such 
protein is designated a lead. 

35. The process of claim 34, wherein the cells are eukaryotic 
cells that are transduced with the vectors. 
5 36. The method of claim 35, wherein at step (f) the titer of the 

viral vectors in each set of cells is determined. 

37. The method of claim 36, wherein the target protein is a 
protein involved in viral replication. 

38. The method of claim 37, wherein the target protein is a Rep 
10 protein. 

39. The AAV mutant Rep protein of claim 6 that binds to a 
sequence from a papillomavirus, oncogene or human immunodeficiency 
virus (HIV) with different affinity from a wild-type AAV Rep protein. 

40. A fusion protein, comprising the tat protein of HIV and the 
15 mutant Rep protein of claim 39. 

41 . The fusion protein of claim 40, wherein the HIV is HIV-1 . 

42. A pharmaceutical composition, comprising the protein of 
claim 39 in a pharmaceutical^ acceptable carrier. 

43. A recombinant adeno-associated virus (rAAV) that encodes 
20 a mutant Rep protein that has increased activity, wherein increased 

activity of a Rep protein is manifested as an increased titer of virus upon 
introduction and replication in a host cell compared to the titer of virus 
upon introduction and replication of a wild type Rep gene. 

44. A mutant AAV Rep protein that has increased activity, 

25 wherein increased activity of a Rep protein is manifested as an increased 
titer of virus upon introduction and replication in a host cell compared to 
the titer of virus upon introduction and replication of a wild type Rep 
gene. 
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45. A nucleic acid molecule that encodes that mutant Rep 

protein of claim 44. 

46. A cell, comprising the nucleic acid molecule of claim 45. 

47. A rAAV, comprising the nucleic acid molecule of claim 45. 
5 48. A cell, comprising the rAAV of claim 47. 

49. A method for production of rAAV, comprising: 
introducing the rAAV of claim 47 into a cell under conditions 

whereby the virus replicates to produce encapsulated rAAV. 

50. A method for the production of mutant Rep protein comprising 
10 expressing the nucleic acid molecule of claim 45. 

51 . The method of claim 50, wherein expression is effected in 

vivo, 

52. The method of claim 50, wherein expression is effected in 

vitro. 

15 53. A method for producing Rep protein in a host cell, comprising: 

expressing the protein encoded by the nucleic acid encoding 
the protein of claim 44, wherein the method is performed in vitro or in 
vivo. 

54. The method of claim 53, wherein the nucleic acid is 
20 introduced into a cell. 

55. The method of claim 53, wherein expression is effected in a 

cell-free system. 

56. A method of treating or inhibiting infection by human 
papilloma virus or a human immunodeficiency virus, comprising 

25 administering, to a subject exposed to the virus or infected with the virus, 
a composition containing a rAAV of claim 47. 

57. A nucleic acid molecule encoding the protein of claim 7. 

58. A nucleic acid molecule encoding the protein of claim 8. 

59. A nucleic acid molecule encoding the protein of claim 9. 
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60. A nucleic acid molecule encoding the protein of claim 10. 

61 . A nucleic acid molecule encoding the protein of claim 1 1 . 

62. A nucleic acid molecule encoding the protein of claim 12. 

63. A nucleic acid molecule encoding the protein of claim 13. 
5 64. A nucleic acid molecule encoding the protein of claim 14. 

65. A recombinant AAV comprising the nucleic acid molecule of 
claim 57. 

66. A recombinant AAV comprising the nucleic acid molecule of 
claim 58. 

10 67. A recombinant AAV comprising the nucleic acid molecule of 

claim 59. 

68. A recombinant AAV comprising the nucleic acid molecule of 
claim 60. 

69. A recombinant AAV comprising the nucleic acid molecule of 
15 claim 61 . 

70. A recombinant AAV comprising the nucleic acid molecule of 
claim 62. 

71 . A recombinant AAV comprising the nucleic acid molecule of 
claim 63. 

20 72. A recombinant AAV comprising the nucleic acid molecule of 

claim 64. 

73. A cell, comprising the recombinant AAV of claim 65. 

74. A cell, comprising the recombinant AAV of claim 66. 

75. A cell, comprising the recombinant AAV of claim 67. 
25 76. A cell, comprising the recombinant AAV of claim 68. 

77. A cell, comprising the recombinant AAV of claim 69. 

78. A cell, comprising the recombinant AAV of claim 70. 

79. A cell, comprising the recombinant AAV of claim 71 . 

80. A cell, comprising the recombinant AAV of claim 72. 
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81. A method for intracellular expression of a mutant Rep 
protein, comprising: 

culturing the cell of claim 73 under conditions and in which 
the AAV Rep proteins are expressed. 
5 82. The method of claim 81 , wherein the AAV replicate. 

83. A method for intracellular expression of a mutant Rep 
protein, comprising culturing the cell of claim 74 under conditions in 
which the AAV Rep proteins are expressed. 

84. The method of claim 83, wherein the AAV replicate. 

10 85. A method of altering expression of a gene, comprising con- 

tacting the gene with a mutant rep protein that has increased activity, 
wherein increased activity of a Rep protein is manifested as an increased 
titer of virus upon introduction and replication in a host cell compared to 
the titer of virus upon introduction and replication of a wild type Rep 

15 gene. 

86. The method of claim 85, wherein the gene is a viral gene. 

87. The method of claim 85, wherein the gene is a cellular gene. 

88. The mutant protein of claim 6, wherein serotype is AAV-1, 
AAV-2, AAV-3, AAV-3B, AAV-4, AAV-5 or AAV-6. 

20 89. The protein of claim 44, wherein the mutation is at a residue 

corresponding to one or more of residues 350, 462, 497, 517, 542, 548, 
598, 600 and 630 of AAV-2. 

90. The mutant protein of claim 89, wherein serotype is AAV-1, 
AAV-2, AAV-3, AAV-3B, AAV-4, AAV-5 or AAV-6. 

i5 91 • The AAV mutant Rep protein of claim 44 that binds to a 

sequence from a papillomavirus, oncogene or human immunodeficiency 
virus (HIV) with different affinity from a wild-type AAV Rep protein. 

92. A pharmaceutical composition, comprising the protein of 
claim 91 in a pharmaceuticafly acceptable carrier. 
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93. A pharmaceutical composition, comprising the rAAV of claim 
47 in a pharmaceutical^ acceptable carrier. 
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7 NLEERKRLVAQFL AE S5QRS - QEAAS QRE F SAD P VI KS KTS QKYMAL VNWL VEHGI XSEK 236 
C NL+ERKRLVA*+L***5Q***Q**** + ***S**PVI*SKTS**YM*LV*WiV*+GirSEK 

250 260 270 280 290 300 

1 QW I QEDQAS YI S FNAASWSRSQXJCAALDNAGKIMALTKS APD YLVGP AP PAD I KTNRI YR 

2 QWI QEDQAS YI S FNAASNSRS QXKAALDNAGKI MALT KSAPDYLVG PAP PAD I KTNR I YR 

3 QWIQEDQAS YI S FNAASNSl*SQXi£AALDNAS KIMSLTKTAPDYLVGSNPPEDITKNRIYQ 

4 QWIQEDQAS YI S FNAASWSJ2SQXKAALDNASKIMSLTKTAPDYLVGSNPPED ITKNRI YQ 

5 QW I QEDQAS YI S FNAAS2/SJ* SQXKAALDNASKIMS LTKTAPDYLVGQNP PED I S SNRI YR 

6 QWIQEDQAS YI S FNAASWSHSQXJCAALDNAGKIMSLTKTAPDYLVGQQPVEDI SSNRI YK 

7 QWI QENQES YLS FNSTGNSR S QXiCAALDNATKIMS LTKS AVDYLVGS SVPED I S KNRI WQ 
C QWIQE *Q* S Y* S FN* * *i^S«RSQXlCAALDNA : KIM+LTK : A*DYLVG : :**+DI: :NTRI* : 

310 320 330 340 350 360 

1 I LELNGYE PAYAGSVPLGWAQKREGKRin 1 ! WLFGPATTGKI^^ AEAXAHAVPF YGCVNWT 

2 I LELNGYD PAYAGSVFLGWAQKRFGKRNT I WLFGPATTGKTNI AEAXAHAVP FYGCVNWT 

3 I LELNGYD PQYAASWLGWAQKKFGKRNT I WLFGPATTGKITNTAEAXAH^ 

4 ILELNGYDPQYAASVFIX3WAQKKFGKROTIWLFGPATTGK 

5 I LEMNGYD PQYAAS VFLGWAQKKFGKRNT I WL FGPArTGKTNI AEAXAHAVP FYGCVNWT 

6 I LELNGYD PQYAAS VFLGWATKKFGKRNT I WLFGPATTGICOTI AEAXAHTVP FYGCVNWT 360 

7 I FEMNGYDPAYAGS I LYGWCQRS FNKRNTVWLYGPArTGKrai AEAXAHTVP FYGCVNWT. 356 
C I*E+NGY*P • YA: S***GW*** : F*KRNT*WL*GPATTGICrNIAEAXAH+ VP FYGCVNWT 



240 
240 
240 
240 
240 
240 



300 
300 
300 
300 
300 
300 
296 



360 
360 
360 
360 
360 
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370 380 390 400 410 420 

1 NEI^PFJ/DCVDKMVIJVWEEGKWT^ 420 

2 NENFPFOTCVDKMVIPWEEGitMTAKVVE^^ 420 

3 blENFPFNDC VDKHVI WWEEGiCMTAKVVESAKAI LGGSKVRVDQKCKSSAQXEPTPVI VTS 420 

4 NENFPFNDCVDKMVI WWEEGiCftrrAKWESAKAJLGGSKVRVDQKCK5SAQXEPrPVIVT5 420 

5 NENFPFMDCXnDKMVIWWEEGKHTAKV^ 420 

6 NENFPFZ/DCVDKMVI WWEEGKMTAKWESAKAI LGGS KVRVDQKCKSS AQXD PTP VI VT5 420 

7 NENFPFNDCVDKML I WWEEGJCMTNKVVEiSAKAI LGGS KVRVDQKCK5S VQ JDS TPVIVTS 416 
C NENFP FNDC VDKM * I WWEEGKMT * KWESAKAI LGGS KVRVDQKCKSS * QX + * TPVI VTS 

430 440 450 460 470 480 

1 im^MCAVJJDGNSTTFEHQQPLQDRMFKFE 480 

2 jmEtfCAVIDGNSTTFEHCQPLQDRMFKFEL 4 80 

3 2mWCAVJI?GNSTTFEHQQPLQDRMFKFELrRRIJDHDFGiCVrKQEVKDFFRW 4 80 

4 NTNftfCAVJPGNSTTFEHCQPLQDRMFKFELTRRLDHDFGJCVrKQEVKDFF 4 80 

5 NTNWGAVJDGNSTTFEHCQPLQDRMFKFELrK^ 4 80 

6 imiafCAVIDGNSTTFEHGQPLQDRMFKFELrRRLD^ 4 8 0 

7 NTNMCVVYDGNSTTFEHQQPLEDRMFKFELrKRLPPDFGICI 476 
C tfTNMC*V*DGNSTTFEHQQPL*DRMFKFELr+RL: *DFGiC*TKQEVK+FF*WA: *** + : V 

490 500 510 520 

1 AHE£ T YVRJCGGANKRP APDD AD KS E P KRA CPSVADPSTSSAEG 522 

2 AHEFYVR2CGGANKRP APDJPAD KSE PKRA CPSVADPSTSDAEG 52 2 

3 AHEFYVRiCGGAKKELP AS ND ADVSE P KRQ CTSLAQ PTTSDAEA 522 

4 AHRFYVRJCGGAKKRPASNDADVSE PKRQ CTSLAQPTTSDAEA 522 

5 THEFYVRJCGGARKRP APND AD ISE PKRA CPSVAQPSTSDAEA 522 

6 EHEFYVKiCGGAKKRPAPSDAD ZS'E PKRV RESVAQPSTSDAEA 522 

7 THEFKTVPRELAGTKGAEKSLKRPLGDVTNTSYKSLEKRA 53 6 
C :HEJ'*V+* **A: ***A ::***-***** : + : *:*:*:** *A* : 

530 540 550 560 570 580 

1 APVDFADRYQNKCSRHAGMLQMLFPCKTCERMNQNFNICFTHGTRDCSECFP- -GVSESQ 580 

2 AP VD F ADR YQNKC S RHAGMLQMLF P CKXCERMNQNFN I C FTHGTRD CSECFP-- GVS ESQ 580 

3 P - ADYADRYQNKCSRHVGMNLMLFPCKTCERMNQISNVCFTHGQRDCGECFPGMSESQPV 581 

4 P -ADYADRYQNKCSRHVGMNLMLFPCKTCERNNQISNVCFTHGQRDCGECFPGMSESQPV 581 

5 P -VDYADRYQNKCSRHVGMNI«MLFPCRQCERMNQNVDICFTHGVMDCAECFP - VSESQPV 580 

6 S - INYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSNICFTHGQKDCLECFP - - VSESQP 579 

7 RPLNWNS RYDCKCDYHAQFDNI SNKCD E CEYLNRGKNGC I CHNVTHCQ I CHG 588 

C : : : + : **RY**KC**H: ** : : ****C: :CE**N* : : * :C**H* : : *C. *C** ..::+::: 

590 600 610 620 

1 PVVRKRTYRKLCAIHHLLGRAPE I ACSACDLVNVDLDDCVSEQ 623 

2 PVVRKRTYRKLCAIHHLLGRAP EI ACSACDLVNVDLDDCVSEQ 623 

3 SVVKKKTYQKXjCPIHHILGRAPEIACSACDLANVDLDDCVSEQ 624 

4 S WKECKTYQKXiC P IHHI LGRAP E I AC S ACDLANVDLDDCVS EQ 624 

5 S WRKRTYQKLC P IHH I MGRAPE VAC S ACELANVDLDDCDMEQ 623 

6 VS WKKAYQKIiCY IHH IMG - KVPDACTACDLVNVDLDDC I FEQ 621 

7 1PPWEKENLSDFGDFDDANKEQ 610 

C :+*:*:++:***:***:: : **+**+* : **D*DD* : :EQ 
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SEQUENCE LISTING 

<110> Vega, Manuel 

Drittanti, Lila 
Plaux, Marjorie 

<120> MUTANT RECOMBINANT ADENO -ASSOCIATED VIRUSES 

<130> 37851-912PC 

<140> To Be Assigned 
<14l> Herewith 

<150> 60/315,382 
<151> 2001-03-27 

<160> 735 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 4 GCT 



<400> 1 



Thr 


Ala 


Gly Ala 


Tyr Glu 


lie Val 


He 


Lys 


Val 


Pro 


Ser Asp 


Leu Asp 


1 






5 






10 










15 




Glu 


His 


Leu Pro 


Gly lie 


Ser Asp 


Ser 


Phe 


Val 


Asn 


Trp 


Val 


Ala 


Glu 






20' 






25 








30 






Lys 


Glu 


Trp Glu 
35 


Leu Pro 


Pro Asp 
40 


Ser 


Asp 


Met 


Asp 


Leu 
45 


Asn 


Leu 


He 


Glu 


Gin 


Ala Pro 


Leu Thr 


Val Ala 


Glu 


Lys 


Leu 


Gin 


Arg Asp 


Phe 


Leu 




50 






55 






60 










Thr 


Glu 


Trp Arg Arg Val 


Ser Lys Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


Val 


65 






70 








75 










80 


Gin 


Phe 


Glu Lys 


Gly Glu 


Ser Tyr 


Phe 


His 


Met 


His 


Val 


Leu 


Val 


Glu 








85 




90 










95 




Thr 


Thr 


Gly Val 
100 


Lys Ser 


Met Val 


Leu 
105 


Gly 


Arg 


Phe 


Leu 


Ser 
110 


Gin 


He 


Arg 


Glu 


Lys Leu 


lie Gin 


Arg lie 


Tyr 


Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 






115 




120 








125 








Pro 


Asn 


Trp Phe 


Ala Val 


Thr Lys 


Thr 


Arg 


Asn 


Gly Ala 


Gly 


Gly Gly 




130 






135 








140 










Asn 


Lys 


Val Val 


Asp Glu 


Cys Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


145 






150 








155 








160 


Thr 


Gin 


Pro Glu 


Leu Gin 


Trp Ala 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 








165 




170 










175 




Ser 


Ala 


Cys Leu 


Asn Leu 


Thr Glu 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 






180 






185 






190 






Leu 


Thr 


His Val 


Ser Gin 


Thr Gin 


Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 






195 




200 








205 








Pro 


Asn 


Ser Asp Ala Pro 


Val lie 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 




210 






215 






220 










Met 


Glu 


Leu Val 


Gly Trp 


Leu Val Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


225 






230 








235 










240 


Gin Trp 


lie Gin Glu Asp 


Gin Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 








245 






250 










255 
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Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 

290 " 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp lie Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 
545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

lie His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
610 615 620 

<210> 2 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 6 8 4 GCT 
<400> 2 

Thr Ala Gly Ala Tyr Glu lie Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
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35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 ~ 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260. 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290. 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320. 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 ~ 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 
515 520 525 
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Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 3 
<211> 621 
<212> PRT 

<213> Artificial Sequence 

< 

<220> 

<223> Mutant rep protein: rep78 10 GCG 



<400> 3 
























Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Ala Val 


Pro 


Ser Asp 


Leu Asp 


1 






5 










10 






15 


Glu 


His 


Leu 


Pro 
20 


vjiy 


lie 




ASp 


ocl 
25 


rile Val 


Asn 


Trp Val 
30 


Ala Glu 


Lys 


Glu 


Trp 


Glu 


ijeu 




ir JTO 


ASp 


C y — k -v" 


Asp i w ie c 


Asp 


Leu Asn 


Leu He 




35 










A A 








45 




Glu 


Gin 


Ala 


pro 


Lieu 


T" 1 V* 


vai 


A "1 -a 

i V-1 a. 


ulU 


T tf e T i 

jjy 5 JLieu. 


Gin Arg Asp 


Phe Leu 




50 










f— r— 

55 








60 






Thr 


Glu 


Trp 


Arg 


Arg 


vax 




Jjy S 




JrX <_> blU 


Ala 


Leu Phe 


Phe Val 


65 






70 








/5 






80 


Gin 


Phe 


Glu 


Lys 


Gly 


bill 


O %« 

t>er 


iyr 


13 Via 


riXS Ineu 


His 


Val Leu 


Val Glu 








85 










yo 






95 


Thr 


Thr 


tsiy 


vax. 


JjyS 






VcLX 




1 «■> 7xT"/"T 

uxy Aiy 


Phe 


Leu Ser 


Gin He 






100 










lub 






110 




Arg 


Glu 


Lys 


iicu 




<oJHT 




J. -L fcr 


iyr 




He 


Glu Pro 


Thr Leu 




115 










120 








125 




Pro 


Asn 


AI P 




Ala 

J- Cx 


Val 


Thr 




Thr 


Srn As n 


Gly Ala Gly Gly Gly 




130 








135 








140 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro Asn 


Tyr 


Leu Leu 


Pro Lys 


145 








150 








155 






160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr Asn 


Met 


Glu Gin Tyr Leu 










165 










170 






175 


Ser 


Ala 


C^s 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 


Lys. Arg 


Leu 


Val Ala 


Gin His 






180 










185 






190 




Leu 


Thr 


His 
195 


Val 


Ser 


Gin 


Thr 


Gin 
200 


Glu 


Gin Asn 


Lys 


Glu Asn 
205 


Gin Asn 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser Lys 


Thr 


Ser Ala Arg Tyr 




210 








215 








220 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys Gly 


He 


Thr Ser 


Glu Lys 


225 










230 








235 






240 


Gin Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr He 


Ser 


Phe Asn 


Ala Ala 










245 










250 






255 


Ser 


Asn 


Ser 


Arg 
260 


Ser 


Gin 


He 


Lys 


Ala 
265 


Ala Leu 


Asp 


Asn Ala 
270 


Gly Lys 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp Tyr 


Leu Val Gly Gin Gin 






275 










280 








285 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He Tyr 


Lys 


He Leu 


Glu Leu 




290 








295 








300 






Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser Val 


Phe 


Leu Gly Trp Ala 


305 








310 








315 






320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He Trp 


Leu Phe Gly Pro Ala 






325 










330 






335 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala He 


Ala 


His Thr Val Pro 






340 










345 






350 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu Asn 


Phe 


Pro Phe Asn Asp 






355 










360 








365 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu 


Gly 


Lys Met 


Thr Ala 


370 








375 








380 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly 


Gly Ser Lys 


Val Arg 
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385 








390 






395 






400 


Val 


Asp Gin Lys 


Cys 


Lys 


Ser 


Ser Ala 


Gin lie 


Asp Pro Thr 


Pro 


Val 








405 








410 




415 




lie 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met Cys 


Ala Val 


lie Asp Gly Asn 


Ser 






420 








425 




430 






Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro Leu 


Gin Asp Arg Met Phe 


Lys 


Phe 






435 








440 




445 




Glu 


Leu Thr Arg Arg 


Leu 


Asp 


His Asp 


Phe Gly Lys Val Thr Lys Gin 




450 








455 






460 






Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp Ala 


Lys Asp 


His Val Val 


Glu 


Val 


465 








470 






475 






480 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys Gly 


Gly Ala 


Lys Lys Arg 


Pro 


Ala 








485 








490 




495 




Pro 


Ser Asp Ala Asp 


lie 


Ser 


Glu Pro 


Lys Arg Val Arg Glu 


Ser 


Val 






500 








505 




510 






Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala Glu 


Ala Ser 


lie Asn Tyr Ala Asp 






515 








520 




525 






Arg 


Tyr 


Gin Asn 


Lys 


Cys 


Ser 


Arg His 


Val Gly 


Met Asn Leu 


Met 


Leu 




530 








535 






540 






Phe 


Pro 


Cys Arg 


Gin 


Cys 


Glu 


Arg Met 


Asn Gin 


Asn Ser Asn 


lie 


Cys 


545 








550 






555 






560 


Phe 


Thr 


His Gly Gin 


Lys 


Asp 


Cys Leu 


Glu Cys 


Phe Pro Val 


Ser 


Glu 








565 








570 




575 




Ser 


Gin 


Pro Val 


Ser 


Val 


Val 


Lys Lys 


Ala Tyr 


Gin Lys Leu 


Cys 


Tyr 






580 








585 




590 






lie 


His 


His lie 


Met 


Gly 


Lys 


Val Pro 


Asp Ala 


Cys Thr Ala 


Cys 


Asp 






595 








600 




605 






Leu 


Val 


Asn Val 


Asp 


Leu Asp 


Asp Cys 


lie Phe 


Glu Gin 








610 








615 






620 







<210> 4 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 10 GCG 



<400> 4 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Ala Val 


Pro Ser Asp 


Leu Asp 


1 








5 










10 




15 


Glu 


His 


Leu 


Pro 


Gly 


lie 


Ser 


Asp 


Ser 


Phe Val 


Asn Trp Val 


Ala Glu 








20 










25 




30 




Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp Met 


Asp Leu Asn 


Leu He 






35 










40 






45 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys Leu 


Gin Arg Asp 


Phe Leu 




50 










55 








60 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro Glu 


Ala Leu Phe 


Phe Val 


65 










70 








75 




80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His Met 


His Val Leu 


Val Glu 










85 










90 




95 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly Arg 


Phe Leu Ser 


Gin He 








100 








105 


110 




Arg 


Glu 


Lys 


Leu 


lie 


Gin 


Arg 


lie Tyr 


Arg Gly He Glu Pro Thr Leu 






115 










120 






125 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg Asn Gly Ala Gly Gly Gly 




130 










135 








140 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro Asn 


Tyr Leu Leu 


Pro Lys 


145 










150 








155 


160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr Asn 


Met Glu Gin 


Tyr Leu 



165 170 175 
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Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 " 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260. 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 

290 295 ""* 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr" Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455* " 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Hie Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp lie Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 ~ 535 

<210> 5 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 20 GCC 
<400> 5 

Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Ala Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
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35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 " 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 " 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 * 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 " 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 " * 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 ~ 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 . 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 ~ 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 " 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 ~ 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 " 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 
515 520 525 
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Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 
545 ~ 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 " 590 

lie His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 ~ 605 

Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
610 615 620 

<210> 6 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 68 20 GCC 



<400> 6 






















Thr 


Ala 


Gly Phe 


Tyr 


Glu 


He 


Val 


He Lys 


Val 


Pro 


Ser Asp 


Leu Asp 


1 






5 








10 








15 


Glu 


His 


Leu Ala 


Gly 


He 


Ser Asp 


Ser Phe 


Val 


Asn 


Trp Val 


Ala Glu 






20 










25 






30 




Lys 


Glu 


Trp Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp 


Met 


Asp 


Leu Asn 


Leu He 






35 








40 








45 




Glu 


Gin 


Ala Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys 


Leu 


Gin Arg Asp 


Phe Leu 




50 








55 








60 






Thr 


Glu 


Trp Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro 


Glu 


Ala 


Leu Phe 


Phe Val 


65 








70 








75 






80 


Gin 


Phe 


Glu Lys 


Gly Glu 


Ser 


Tyr 


Phe His 


Met 


His 


Val Leu 


Val Glu 






85 








90 








95 


Thr 


Thr 


Gly Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg 


Phe 


Leu Ser 


Gin He 






100 










105 






110 




Arg 


Glu 


Lys Leu 


He 


Gin 


Arg 


He 


Tyr Arg 


Gly 


He 


Glu Pro 


Thr Leu 






115 








120 








125 




Pro 


Asn 


Trp Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala Gly Gly Gly 




130 








135 








140 






Asn 


Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


He Pro 


Asn 


Tyr 


Leu Leu 


Pro Lys 


145 








150 








155 






160 


Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr 


Asn 


Met 


Glu Gin 


Tyr Leu 








165 








170 








175 


Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr Glu Arg Lys Arg Leu Val Ala Gin His 






180 










185 






190 




Leu 


Thr 


His Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin 


Asn 


Lys 


Glu Asn 


Gin Asn 






195 








200 








205 




Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He Arg Ser 


Lys 


Thr 


Ser Ala Arg Tyr 




210 








215 








220 






Met 


Glu 


Leu Val 


Gly Trp 


Leu 


Val 


Asp Lys 


Gly He 


Thr Ser 


Glu Lys 


225 








230 








235 






240 


Gin 


Tip 


He Gin 


Glu Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser 


Phe Asn Ala Ala 








245 








250 








255 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


He Lys Ala Ala Leu Asp Asn Ala Gly Lys 






260 










265 






270 




He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val Gly Gin Gin 






275 








280 




285 




Pro 


Val 


Glu Asp 


He 


Ser 


Ser Asn Arg He 


Tyr 


Lys 


He Leu 


Glu Leu 




290 






295 








300 






Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr Ala Ala Ser 


Val 


Phe 


Leu Gly Trp Ala 
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305 










310 




Thr 


Lys 


Lys 


Phe 


Gly Lys Arg 














Thr 


Thr 


Gly Lys 


1 tlx 


Asn 


He 








340 








Pne 


Tyr 


Gly Cys 


Val 


Asn 


Trp 






355 










Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


370 










375 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


385 










390 




Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 








405 






He 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 




450 








455 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


465 








470 




Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Pro 


Ser 


Asp 


Ala 
500 


Asp 


He 


Ser 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 






515 










Arg 


Leu 


Ala Arg 


Gly His 


Ser 


530 










535 
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315 320 
Asn Thr He Trp Leu Phe Gly Pro Ala 

330 335 
Ala Glu Ala He Ala His Thr Val Pro 

345 350 
Thr Asn Glu Asn Phe Pro Phe Asn Asp 
360 365 
Trp Trp Glu Glu Gly Lys Met Thr Ala 

380 

Ala He Leu Gly Gly Ser Lys Val Arg 

395 400 
Ser Ala Gin He Asp Pro Thr Pro Val 

410 415 
Met Cys Ala Val He Asp Gly Asn Ser 

425 430 
Pro Leu Gin Asp Arg Met Phe Lys Phe 
440 445 
His Asp Phe Gly Lys Val Thr Lys Gin 

460 

Trp Ala Lys Asp His Val Val Glu Val 

475 480 
Lys Gly Gly Ala Lys Lys Arg Pro Ala 

490 495 
Glu Pro Lys Arg Val Arg Glu Ser Val 

505 510 
Ala Glu Ala Ser He Asn Tyr Ala Asp 
520 525 



<210> 7 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 22 GCT 



<400> 7 
















Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


He 


1 
Glu 


His 


Leu 


Pro 


Gly 


Ala 


Ser 


Asp 


Ser 








20 








25 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 




35 










40 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 




50 










55 






Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


65 








70 






Phe 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 








85 










Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 






100 










105 


Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 




115 










120 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 




130 








135 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


145 








150 




Ala 




Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Trp 



10 15 
Phe Val Asn Trp Val Ala Glu 

30 

Asp Met Asp Leu Asn Leu He 

45 

Lys Leu Gin Arg Asp Phe Leu 
60 

Pro Glu Ala Leu Phe Phe Val 

75 80 
His Met His Val Leu Val Glu 
90 95 
Gly Arg Phe Leu Ser Gin He 

110 

Arg Gly He Glu Pro Thr Leu 

125 

Arg Asn Gly Ala Gly Gly Gly 
140 

Pro Asn Tyr Leu Leu Pro Lys 
155 160 
Thr Asn Met Glu Gin Tyr Leu 
165 " 170 175 
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Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 ~ 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 ~ 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 ~ 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 8 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
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<223> Mutant rep protein: rep68 22 GOT 

?nr°Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

i 5 10 

Glu His Leu Pro Gly Ala Ser Asp Ser Phe Val Asm Trp Val Ala Glu 

20 25 _ 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

Thr Glu Trp Arg Arg Val ler Lys Ala Pro Glu Ala Leu Phe Phe Val 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 * 105 T 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

lie 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

i ac 150 155 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 x/o 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 m. 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

Gin Trp He Gin Glu III Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 ? 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

.260 265 z/u 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 
275 2 80 285 „, , 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 , 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
i f\ e 310 _ 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

Thr Thr Gly Lys III Asn He Ala Glu Ala He Ala His Thr Val Pro 

- - - 345 35U 



Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

•5*70 375 380 

Lys Val Val Glu Ser Ala. Lys Ala He Leu Gly Gly Ser Lys Val Arg 

nnr 390 395 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

~ 405 410 _ 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He. Asp Gly Asn Ser 

420 425 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

.«r 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe . Gly Lys Val Thr Lys Gin 
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450 










455 




Ala 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 








470 






Gly 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 










Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 






500 










505 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 




515 










520 




Arg 


Leu 


Ala 


Arg 


Gly His 


Ser 


Leu 




530 










535 






<210> 9 

















460 

Lys Asp His Val Val Glu Val 
475 480 
Gly Ala Lys Lys Arg Pro Ala 
490 495 
Lys Arg Val Arg Glu Ser Val 

510 

Ala Ser He Asn Tyr Ala Asp 

525 



20 25 
Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 _ 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

55 6° 



<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 29 GCG 

Thrill Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

X 5 10 _ 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Ala Val Ala Glu 

20 2 *■* 

u Trp Glu Leu Pro Pro Asp 

35 40 
n Ala Pro Leu Thr Val Ala 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
fic 70 " 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 110 

Arg Glu Lys Leu He' Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 
— X20 12b 

Lys Thr Arg Asn 

135 140 
Tyr He Pro Asn 

155 

Ala Trp Thr Asn 
170 

Glu Arg Lys Arg 
185 

Gin Glu Gin Asn 
200 

He Arg Ser Lys 

Val Asp Lys Gly 
-i ■») - 23u 235 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 z == 



115 * J " ^* w 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 



130 j-oj _ 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 



150 I 55 



165 ' w 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

His Val Ser Gin Thr Gin 
195 200 
Ser Asp Ala Pro Val He 
->10 215 — — 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 



170 175 



180 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

200 " ?CiH 
Val He Arg Ser Lys Thr 
215 ~ 220 



195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 



jf 4 j ^ v 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

Ser Leu Thr Lys Thr Ala 
275 280 
Glu Asp He Ser Ser Asn 
290 295 - 1 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

' 31() J £j * j 



260 265 
He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 



305 
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Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 
Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 
Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
Cys Val HI Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
Lys vI5 Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
Vaf Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
lie Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
Thr Thr Phe £u His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
Glu Leu Tur Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
Glu Va? Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
G?u His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 



Pro Ser Asp Ala ill He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 



Ala Gin Pro ler Thr Ser Asp Ala gIu Ala Ser He Asn Tyr Ala Asp 



505 



Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 



520 



525 



Phe Pro cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 



540 



Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 



555 



Ser Gin Pro Val lei Val Val Lys Lys ida Tyr Gin Lys Leu Cys Tyr 



570 



lie His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 



585 



590 



Leu Val III Val Asp Leu Asp Isp Cys He Phe Glu Gin 



600 



605 



610 



615 



620 



<210> 10 
<211> 536 
<212> PRT 

<213> Artificial Sequence 

<223> Mutant rep protein: rep68 29 GCG 

Thr°Ala°Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
G L His Leu Pro G ly He Ser Asp Ser Phe Val Asn Ala Val Ala Glu 
L ys Glu Trp ISu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
Glu Gin aL Pro Leu Thr Val Ita Glu Lys Leu Gin Arg Asp Phe Leu 
Thr Olu Trp Arg Arg Val ler Lys Ala Pro Glu Ala Leu Phe Phe Val 
Sn Phe Glu Lys Gly G?u Ser Tyr Phe His Met His Val Leu Val Glu 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
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105 



110 



Arg Glu Lys Leu He Gin Arg lie Tyr Arg Gly He Glu Pro Thr Leu 

Pro. Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

Thr Gin Pro Glu Leu cln Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

1 RO 185 
Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 



III Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 



265 



He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

Pro Val gIu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

Asn Gly Tyr Asp Pro Gin £yr Ala Ala Ser Val Phe Leu Gly Trp Ala 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

Thr Thr Gly Lys TnT Asn He Ala Glu Ala He Ala His Thr Val Pro 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

cys Val III Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

Lys vll Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

Glu Leu Thr Arg Arg Leu Asp nts Asp Phe Gly Lys Val Thr Lys Gin 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

g!u His Glu Phe Tyr vIS Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 _ 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

Ala Gin Pro ler Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 11 
<2H> 621 
<212> PRT 

<213> Artificial Sequence 



WO 03/018820 



PCT7IB02/04087 



-15- 



<220> 

<223> Mutant rep protein: rep78 3 8 GCG 



<400> 11 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


lie 


Gin 


Ser 


Asn 


Ser 


Arg 








2£0 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


lie 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr 


Arg 



Tyr 


Glu 


lie 


Val 


5 








Gly 


lie 


Ser 


Asp 


Leu 


Ala 


Pro 


Asp 








A ft 

40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


lie 


Gin 


Arg 


lie 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 










Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 










Ser 


Gin 


lie 


Lys 


Thr 


Lys 


Thr 


Ala 








280 


lie 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 


325 








Thr 


Asn 


lie 


Ala 


Val 


Asn 


Trp 


Thr 








360 


Met 


Val 


lie 


Trp 






375 




Ser 


Ala 


Lys 


Ala 




390 






Cys 


Lys 


Ser 


Ser 


405 








Asn 


Thr 


Asn 


Met 


His 


Gin 


Gin 


Pro 








440 


Arg 


Leu 


Asp 


His 



lie 


Lys 


Val 


Pro 




10 






Ser 


Phe 


Val 


Asn 


o c 








Ser 


Asp 


Met 


Asp 


Glu 


Lys 


Leu 


Gin 








60 


Ala 


Pro 


Glu 


Ala 






75 




Phe 


His 


Met 


His 




90 






Leu 


Gly Arg 


Phe 


105 








Tyr 


Arg Gly 


He 


Thr 


Arg Asn 


Gly 








140 


lie 


Pro 


Asn 


Tyr 






155 




Trp 


Thr 


Asn 


Met 


170 






Arg 


Lys 


Arg 


Leu 


IOC 

185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


m"L, 

Tnr 








Tift 


Asp 


Lys 


Gly 


lie 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 








Pro 


Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 






315 




Thr 


He 


Trp 


Leu 




330 






Glu 


Ala 


He 


Ala 


345 








Asn 


Glu 


Asn 


Phe 


Trp 


Glu 


Glu 


Gly 








380 


He 


Leu Gly 


Gly 






395 




Ala 


Gin 


He 


Asp 




410 






Cys 


Ala 


Val 


He 


425 








Leu 


Gin Asp 


Arg 


Asp 


Phe 


Gly 


Lys 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 
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450 








455 




Ala 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


465 






470 






Gly 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 






485 










Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 


Pro 




500 










505 


Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


515 








520 


His 


Arg 


Tyr 


Gin Asn 


Lys 


Cys 


Ser 


Arg 


530 








535 




Met 


Phe 


Pro 


Cys Arg 


Gin 


Cys 


Glu 


Arg 


545 






550 




Cys 


Leu 


Phe 


Thr 


His Gly 


Gin 
565 


Lys 


Asp 


Ser 


Gin 


Pro Val 


Ser 


Val 


Val 


Lys 


Lys 


580 










585 


lie 


His 


His He 


Met 


Gly Lys 


Val 


Pro 


595 






• 


600 


Cys 


Leu 


Val 


Asn Val 


Asp 


Leu 


ASp 


Asp 


610 








615 







460 

Lys Asp His Val Val Glu Val 
475 480 
Gly Ala Lys Lys Arg Pro Ala 
490 495 
Lys Arg Val Arg Glu Ser Val 

510 

Ala Ser He Asn Tyr Ala Asp 

525 

Val Gly Met Asn Leu Met Leu 
540 

Asn Gin Asn Ser Asn He Cys 
555 560 
Glu Cys Phe Pro Val Ser Glu 

570 575 
Ala Tyr Gin Lys Leu Cys Tyr 

590 

Asp Ala Cys Thr Ala Cys Asp 

605 

He Phe Glu Gin 
620 

<210> 12 
<211> 536 
<212> PRT 

<213> Artificial Sequence 

<223> Mutant rep protein: rep68 3 8 GCG 

£nr°Ala 2 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

_ 5 10 xo 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

Lys Glu Trp Glu Leu Ala Pro Asp Ser Asp Met Asp Leu Asn Leu He 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

_ n 55 60 

Thr Giu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
c c 70 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

i no 105 
Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

lis 120 125 , „ „. 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

1 A C 150 JLOO 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 J-fu 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 
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He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

--- 280 285 



Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

i rt 315 ^ 



Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 
Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

Pro Val Glu Asp He Ser Ser A^n Arg He Tyr Lys lie Leu Glu Leu 
290 ^ 295 300 

Gly 

310 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He ^ Leu Phe Gly Pro Ala 

325 330 JJs 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
■j nc 390 395 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 4 1° 
He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

jt r- rr A 0 



450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 *ou 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 



485 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 5 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 - 535 

<210> 13 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 39 GCA 
<400> 13 

31v Phe Tyr Glu He Val lie i^ys vai fro 
: 5 10 15 

p ro Gly He Ser Asp Ser Phe Val Asn Trp 
20 25 30 

u Leu Pro Ala Asp Ser Asp Met Asp 
35 " An 45 

Gin Ali 
50 

a Trp Arg Arg Val Ser Lys Ala fro uj.u Aia ueu xr^ 
fZ c ~ 70 75 

wl-- t.._ m„ c^r ihrr PVi^ His Met His Val Leu 

95 



Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


5 










10 






Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn 








25 








Leu 


Pro 


Ala 


Asp 


Ser 


Asp 


Met 


Asp 








40 










Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin 






55 










60 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


70 










75 


His 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 


85 










90 




Phe 


Lys 


Ser 


Met 


Val 


Leu 


Gly Arg 
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100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Tro 


lie 


Gin 


UJ W -A- 


Asn 


Ser 


Arc? 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lvs 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


lie 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr 


Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


Gin 


Asn 




530 






Phe 


Pro 


Cys 


Arg 


545 








Phe 


Thr 


His 


Gly 


Ser 


Gin 


Pro 


Val 








580 



lie 


Gin Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 






280 


lie 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 


325 








Thr 


Asn 


He 


Ala 


Val 


Asn 


Trp 


Thr 








360 


Met 


Val 


He 


Trp 






375 




Ser 


Ala 


Lys 


Ala 




390 






Cys 


Lys 


Ser 


Ser 


405 








Asn 


Thr 


Asn 


Met 


His 


Gin 


Gin 


Pro 








440 


Arg 


Leu Asp 


His 






455 




Phe 


Phe 


Arg 


Trp 




470 






Tyr 


Val 


Lys 


Lys 


485 








Asp 


He 


Ser 


Glu 


Thr 


Ser Asp 


Ala 








520 


Lys 


Cys 


Ser 


Arg 






535 




Gin 


Cys 


Glu 


Arg 




550 






Gin 


Lys 


Asp 


Cys 


565 








Ser 


Val 


Val 


Lys 



t fie; 

X \J ZD 








Tyr 


Arg 


Gly 


He 


Thr 


Arg 


Asn 


Gly 








140 


He 


Pro 


Asn 


Tyr 






155 




Trp 


Thr 


Asn 


Met 


170 






Arg 


Lys 


Arg 


Leu 


185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


Thr 








220 


Asp 


Lys 


Gly 


He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 








Pro 


Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 






315 




Thr 


He 


Trp 


Leu 




330 






Glu 


Ala 


He 


Ala 


345 








Asn 


Glu 


Asn 


Phe 


Trp 


Glu 


Glu 


Gly 






380 


He 


Leu 


Gly 


Gly 






395 




Ala 


Gin 


He 


Asp 




410 






Cys 


Ala 


Val 


He 


425 








Leu 


Gin Asp 


Arg 


Asp 


Phe 


Gly 


Lys 






460 


Ala 


Lys 


Asp 


His 






475 




Gly 


Gly Ala 


Lys 




490 






Pro 


Lys 


Arg 


Val 


505 








Glu 


Ala 


Ser 


He 


His 


Val 


Gly 


Met 








540 


Met 


Asn 


Gin 


Asn 






555 




Leu 


Glu 


Cys 


Phe 




570 






Lys 


Ala 


Tyr 


Gin 


585 











110 






Glu 


Pro 


Thr 


Leu 


-L ZD 








Ala 


Gly 


Gly Gly 


Leu 


Leu 


Pro 


Lys 








160 


Glu 


Gin 


Tyr 


Leu 






175 




Val 


Ala 


Gin 


His 




190 






Glu 


Asn 


Gin 


Asn 


one 








Ser 


Ala 


Arg 


Tyr 


Thr 


Ser 


Glu 


Lys 








240 


Phe 


Asn 


Ala 


Ala 






255 




Asn 


Ala 


Gly Lys 




270 






Val 


Gly 


Gin 


Gin 


one 

2 ob 








He 


Leu 


Glu 


Leu 


Leu 


Gly 


Trp 


Ala 








320 


Phe 


Gly 


Pro 


Ala 




335 




His 


Thr 


Val 


Pro 




350 






Pro 


Phe 


Asn Asp 










Lys 


Met 


Thr 


Ala 


Ser 


Lys 


Val 


Arg 






400 


Pro 


Thr 


Pro 


Val 






415 




Asp 


Gly 


Asn 


Ser 




430 






Met 


Phe 


Lys 


Phe 


445 








Val 


Thr 


Lys 


Gin 


Val 


Val 


Glu 


Val 








480 


Lys 


Arg 


Pro 


Ala 






495 




Arg 


Glu 


Ser 


Val 




510 






Asn 


Tyr 


Ala 


Asp 


525 








Asn 


Leu 


Met 


Leu 


Ser 


Asn 


He 


Cys 








560 


Pro 


Val 


Ser 


Glu 






575 




Lys 


Leu 


Cys 


Tyr 




590 







WO 03/018820 



PCT/EB02/04087 



-19- 



Ile His His lie 
595 

Leu Val Asn Val 
610 



Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



Pro Asp Ala Cys 

Cys lie Phe Glu 

620 



Thr Ala Cys Asp 

605 

Gin 



<210> 14 
<211> 536 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep68 



<400> 14 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Ala 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 








85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


lie 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 










150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 








130 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


lie 




210 








215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 








260 










He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 




290 








295 




Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


305 








310 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 






325 








Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




355 










360 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 



39 GCA 



He 


Lys Val 


Pro 


Ser Asp Leu 


Asp 




10 






15 




Ser 


Phe Val 


Asn 


Trp 


Val Ala 


Glu 


25 








30 




Ser 


Asp Met 


Asp 


Leu 


Asn Leu 


He 








45 






Glu 


Lys Leu 


Gin 
60 


Arg 


Asp Phe 


Leu 


Ala 


Pro Glu 


Ala 


Leu 


Phe Phe 


Val 




75 








80 


Phe 


His Met 


His 


Val 


Leu Val 


Glu 




90 






95 




Leu 


Gly Arg 


Phe 


Leu 


Ser Gin 


He 


105 






110 




Tyr 


Arg Gly 


He 


Glu 


Pro Thr 


Leu 






125 






Thr 


Arg Asn Gly 


Ala Gly Gly Gly 






140 








He 


Pro Asn Tyr 


Leu 


Leu Pro 


Lys 




155 








160 


Trp 


Thr Asn 


Met 


Glu Gin Tyr 


Leu 


170 






175 


His 


Arg 


Lys Arg 


Leu 


Val 


Ala Gin 


185 






190 




Glu 


Gin Asn 


Lys 


Glu 


Asn Gin 


Asn 








205 






Arg 


Ser Lys 


Thr 


Ser Ala Arg 


Tyr 




220 








Asp 


Lys Gly 


He 


Thr 


Ser Glu 


Lys 


235 








240 


Ser 


Tyr He 


Ser 


Phe 


Asn Ala 


Ala 




250 






255 




Ala 


Ala Leu Asp 


Asn Ala Gly Lys 


265 








270 




Pro 


Asp Tyr 


Leu 


Val Gly Gin Gin 






285 






Arg 


He Tyr 


Lys 


He 


Leu Glu 


Leu 




300 








Ala 


Ser Val 


Phe 


Leu Gly Trp 


Ala 




315 








320 


Thr 


He Trp 


Leu 


Phe Gly Pro Ala 




330 






335 




Glu 


Ala He 


Ala 


His 


Thr Val 


Pro 


345 








350 




Asn 


Glu Asn 


Phe 


Pro 


Phe Asn Asp 








365 






Trp 


Glu Glu Gly 


Lys 


Met Thr 


Ala 



WO 03/018820 



PCT/IB02/04087 



-20- 



375 380 
Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

i. 390 395 

III A S p Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

4"*5 440 44 f , 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

455 4 o vj - » 

Glu va? Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
AC c 470 475 ? 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 
Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 ■ 3A 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 
Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 15 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 53 GCT 

Thr°Ala 5 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

25 JU 

:> Asp 

40 
L Ala 

Thr Su Trp Arg Arg Val ler Lys Ala Pro Glu Ala Leu Phe Phe Val 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

Glu Gin Ala Pro Ala Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

- - 60 



70 75 
Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

TOO 105 AJ.U 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 Izd 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 

Tnr Gin Pro Glu Leu Gin Trp Ala Trp Thr A^n Met Glu Gin Tyr Leu 

165 17 0 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 



Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

150 155 



180 185 "° 

His 
195 
Ser 

Met llu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 



Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

He Arg Ser Lys Thr 
215 " 220 



Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

Trp Leu Val Asp Lys Gly 
225 230- 235 
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Gln Trp lie Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

265 ^ ' u 



lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

07c 280 <zvz> 

Pro Val III Asp He Ser Ser. Asn Arg He Tyr Lys He Leu Glu Leu 



~ Qr 295 300 

Asn G?y Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

-540 345 J f , 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 



360 



Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
Lys III Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
III Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

a 1 c 440 Tc^ts 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

Glu Var Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

all His Glu Phe Tyr vll Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

435 490 133 



Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 3 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

520 525 „ , _ 

Arg Tyr III Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

GAEL 550 555 J ww 

Ihe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 
Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 " u 

lie His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 



5S5 600 

610 



Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 

615 620 



<210> 16 
<211> 536 
<212> PRT 

<213> Artificial Sequence 

<223> Mutant rep protein: rep68 53 GCT 

Tnr°Ala 6 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
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20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 ^ r 

Glu Gin Ala Pro Ala Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 ^ 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 , _ /"n 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

Tnr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 /D 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 i85 
Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

215 ^ u 
Gly Trp Leu Val Asp Lys 
230 

Glu Asp Gin Ala Ser Tyr 
245 250 



oi o 215 **~ w _ 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
->-,c 230 235 , 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 
Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

Leu Thr Lys Thr Ala Pro 

2 80 

Asp He Ser Ser Asn Arg 
Asn GlJ Tyr Asp Pro Gin ^r Ala Ala Ser Val Phe Leu Gly Trp Ala 



He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 » 
Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
" 295 300 



Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 " u 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

3 9 0 



405 _ 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 



Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 
Asn Thr Asn Met Cys Ala 

425 

His Gin Gin Pro Leu Gin 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
acq 455 460 , ^_ _ 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

Pro Ser Asp Ala AsJ He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 5XU 
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Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 17 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 59 GCG 

Thr°Ala 7 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu ixe 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Ala Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 HO 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin Hxs 

1*0 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 " 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
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370 










375 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


lie 


385 










390 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 








405 










lie 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Pro 
440 


Leu 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp 




450 










455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg 


Trp 


Ala 


465 








470 








Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Gly 


Pro 


Ser 


Asp Ala Asp 


lie 


Ser 


Glu 


Pro 








500 










505 


Ala 


Gin 


Pro 
515 


Ser 


Thr 


Ser 


Asp 


Ala 
520 


Glu 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


His 


530 










535 






Phe 


Pro 


Cys Arg Gin 


Cys 


Glu 


Arg 


Met 


545 










550 








Phe 


Thr 


His 


Gly 


Gin 
565 


Lys 


Asp 


Cys 


Leu 


Ser 


Gin 


Pro 


Val 
580 


Ser 


Val 


Val 


Lys 


Lys 
585 


lie 


His 


His 


lie 


Met 


Gly Lys 


Val 


Pro 






595 










600 




Leu 


Val 
610 


Asn 


Val 


Asp 


Leu 


Asp 
615 


Asp 


Cys 



<210> 18 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 68 59 



<400> 18 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


He 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 








20 








25 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 




35 










40 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 




50 










55 






Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


65 










70 








Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 








85 










Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 






100 








105 


Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 




115 










120 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 




130 








135 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 



145 150 
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380 










Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Arg 




395 










400 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Ala 


Val 


He 


Asp 


Gly 


Asn 


Ser 










430 






Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 








445 








Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 






460 










Lys 


Asp 


His 


Val 


Val 


Glu 


Val 


*± /o 










4 80 


Gly 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 










.495 




Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 








510 






Ala 


Ser 


He 


Asn 


Tyr 


Ala 


Asp 








525 








Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 




540 










Asn 


Gin 


Asn 


Ser 


Asn 


He 


Cys 




555 










560 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 


570 








575 




Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 








590 






Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 






605 








He 


Phe 


Glu 


Gin 












620 










GCG 














Lys 


Val 


Pro 


Ser 


Asp 


Leu 


Asp 


10 










15 




Phe 


Val 


Asn 


Trp 


Val 


Ala 


Glu 










30 






Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 






45 








Lys 


Ala 


Gin 


Arg 


Asp 


Phe 


Leu 




60 










Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


Val 




75 










80 


His 


Met 


His 


Val 


Leu 


Val 


Glu 


90 










95 




Gly 


Arg 


Phe 


Leu 


Ser 


Gin 


He 








110 






Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 






125 








Arg 


Asn 


Gly 


Ala 


Gly 


Gly 


Gly 






140 










Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 




155 








160 
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Leu Thr His VaX Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

. . *\ f\ f\ J. 1 1 ^> 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Ash Met Glu Gin Tyr Leu 

165 170 
Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 19° 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 4oo 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

265 * /u 



260 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

««f- inn 



290 295 30° 



Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

^ ac 310 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 Jos 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 
Cys Val Asn Trp Thr 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 



340 345 
Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 



370 3 ?5 3 ?° 



Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
3 85 390 395 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

425 430 



^20 I < r - ' 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 44( > 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 - 455 4 *>0 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 ~ 4 70 475 4 !? u 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 4 90 495 , 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 19 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 64 GCT 

Thr°Ala 9 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 3.0 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
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L ys Glu Trp S« Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
Glu Gin aL Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Ala 
Thr llu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
Gin Phe Glu Lys Gly Slu Ser Tyr Phe His Met His Val Leu Val Glu 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
Arg Glu Lys Leu He Gin Arg lie Tyr Arg Gly He Glu Pro Thr Leu 
Pro Asn TrP Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
Asn Lys val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
Tnr Gin Pro Glu Leu III Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 
Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 
L eu Thr His Va£ Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 
Pro Asn III Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 
Met llu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

225 ■, 230 _ 0 __ T ie o er phe Asn Ala Ala 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr lie ber mi 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 
Pro Val III Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
Asn lly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 
Thr Thr Gly Lys Asn He Ala Glu Ala He Ala His Thr Val Pro 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
Cys Val Asp Lys Met Val lie Trp Glu Glu Gly Lys Met Thr Ala 

Lys III Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
va! Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
lie Val Thr Ser til Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
Glu Leu Thr Arg Arg Leu Asp til Asp Phe Gly Lys Val Thr Lys Gin 
Glu Va? Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
IS His Glu Phe Tyr vll Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
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R15 520 , _ 

Arg Tyr Sin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 



Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

- , ' n i35 " 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

Ihe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

- — 57 0 



Ser Gin Pro Val Ser Val Val Lys Lys Aia Tyr Gin Lys Leu Cys Tyr 

580 585 
He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 V 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 



610 



615 620 



<210> 20 
<211> 536 
<212> PRT 

<213> Artificial Sequence 

<220> r>r**v 
<223> Mutant rep protein: rep68 64 GCT 

£nr°Ala°Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

i 5 10 ^ _ 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 _ 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

*3 c 40 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Ala 

_ 55 
Thr I2u Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 " 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

Arg Glu Lys Leu He Gin Arg He £yr Arg Gly He Glu Pro Thr Leu 

Pro Asn tS Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

_ _ 135 X4U 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 
Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

i<« 200 205 _ 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

215 

Met lln Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
10 c » 230 235 " w 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 
Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 — 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 
9"7c: 280 ^, 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
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Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 » 1= r-in sia Tie Ala His Thr Val Pro 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala lie Axa ax ^ 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

Cys Val HI Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

Ly s III val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

III Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

Glu Leu Thr Arg Arg Leu Asp Hi* Asp Phe Gly Lys Val Thr Lys Gin 

Glu Vax Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

Gxu His Glu Phe Tyr vll Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

Pro Ser Asp Ala As P He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

Ala Gin Pro ler Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 



515 520 
Arg Leu Ala Arg Gly His Ser Leu 

530 535 

<210> 21 
<211> 621 
<212> PRT 

<213> Artificial Sequence 

<223> Mutant rep protein: rep78 74 GCG 



Th^AlaW Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
Gxu His Leu Pro Gxy He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
Lys Glu Trp Gxu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
Glu Gin Axa Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
Thr Gxu Trp Arg Arg Val Ser Lys Ala Ala Glu Ala Leu Phe Phe Val 
Sa Phe Glu Lys Gly Gxu Ser Tyr Phe His Met His Val Leu Val Glu 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 
Pro Asn Trp" Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

145 150 
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Thr Gin Pro Glu 

Ser Ala Cys Leu 

180 

Leu Thr His Val 
195 

Pro Asn Ser Asp 
210 

Met Glu Leu Val 
225 

Gin Trp He Gin 

Ser Asn Ser Arg 

260 

He Met Ser Leu 
275 

Pro Val Glu Asp 
290 

Asn Gly Tyr Asp 
305 

Thr Lys Lys Phe 

Thr Thr Gly Lys 

340 

Phe Tyr Gly Cys 
355 

Cys Val Asp Lys 
370 

Lys Val Val Glu 
385 

Val Asp Gin Lys 

He Val Thr Ser 

420 

Thr Thr Phe Glu 
435 

Glu Leu Thr Arg 
450 

Glu Val Lys Asp 
465 

Glu His Glu Phe 

Pro Ser Asp Ala 

500 

Ala Gin Pro Ser 
515 

Arg Tyr Gin Asn 
530 

Phe Pro Cys Arg 
545 

Phe Thr His Gly 

Ser Gin Pro Val 

580 

lie His His He 
595 

Leu Val Asn Val 
610 



Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 

380 

He Leu Gly Gly 
395 

Ala Gin He Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro Lys Arg Val 
505 

Glu Ala Ser He 

His Val Gly Met 

540 

Met Asn Gin Asn 
555 

Leu Glu Cys Phe 
570 

Lys Ala Tyr Gin 
585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



Glu 


Gin 


Tyr 


Leu 






175 




Val 


Ala 


Gin 


His 




190 






Glu 


Asn 


Gin 


Asn 


205 








Ser Ala Arg 


Tyr 


Thr 


Ser 


Glu 


Lys 








240 


Phe 


Asn 


Ala 


Ala 






255 




Asn Ala Gly 


Lys 




270 






Val 


Gly Gin 


Gin 


285 








He 


Leu 


Glu 


Leu 


Leu Gly Trp 


Ala 








320 


Phe Gly 


Pro 


Ala 






335 




His 


Thr 


Val 


Pro 




350 






Pro 


Phe 


Asn 


Asp 


365 








Lys 


Met 


Thr 


Ala 


Ser 


Lys 


Val 


Arg 








400 


Pro 


Thr 


Pro 


Val 






415 




Asp 


Gly Asn 


Ser 




430 






Met 


Phe 


Lys 


Phe 


445 








Val 


Thr 


Lys 


Gin 


Val 


Val 


Glu 


Val 








480 


Lys 


Arg 


Pro 


Ala 






495 




Arg 


Glu 


Ser 


Val 




510 






Asn Tyr Ala 


Asp 


525 








Asn 


Leu 


Met 


Leu 


Ser 


Asn 


He 


Cys 








560 


Pro 


Val 


Ser 


Glu 






575 




Lys 


Leu 


Cys 


Tyr 




590 






Thr 


Ala 


Cys 


Asp 


605 








Gin 









<210> 22 
<211> 536 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 74 GCG 
<400> 22 

Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Ala Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 " no 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 ISO 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 ~ 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 * 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 * 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 * 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 
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Thr 


Thr 


Phe Glu His Gin 


Gin 


Pro 






435 




440 


Glu 


Leu 


Thr Arg Arg Leu 


Asp 


His 




450 




455 




Glu 


Val 


Lys Asp Phe Phe 


Arg 


Trp 


465 




470 






Glu 


His 


Glu Phe Tyr Val 


Lys 


Lys 






485 






Pro 


Ser 


Asp Ala Asp lie 


Ser 


Glu 






500 






Ala 


Gin 


Pro Ser Thr Ser Asp Ala 






515 




520 


Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 


530 




535 





<210> 23 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



Leu Gin Asp Arg Met Phe Lys Phe 

445 

Asp Phe Gly Lys Val Thr Lys Gin 

460 

Ala Lys Asp His Val Val Glu Val 
475 480 
Gly Gly Ala Lys Lys Arg Pro Ala 

490 495 
Pro Lys Arg Val Arg Glu Ser Val 
505 ^ 510 
Glu Ala Ser He Asn Tyr Ala Asp 

525 



<220> 

<223> Mutant rep protein: rep78 86 GCG 



<400> 23 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly, 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


He 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Ala 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 






280 


He 


Ser 


Ser 


Asn 



He 


Lys 


Val 


Pro 




10 






Ser 


Phe 


Val 


Asn 


25 








Ser 


Asp 


Met 


Asp 


Glu 


Lys 


Leu 


Gin 






60 


Ala 


Pro 


Glu 


Ala 






75 




Phe 


His 


Met 


His 




90 






Leu 


Gly Arg 


Phe 


105 








Tyr 


Arg Gly 


He 


Thr 


Arg 


Asn 


Gly 








140 


He 


Pro 


Asn 


Tyr 






155 




Trp 


Thr 


Asn 


Met 


170 






Arg 


Lys 


Arg 


Leu 


185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


Thr 








220 


Asp 


Lys 


Gly 


He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 








Pro 


Asp 


Tyr 


Leu 


Arg 


He Tyr 


Lys 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 
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290 

Asn Gly Tyr Asp 
305 

Thr Lys Lys Phe 

Thr Thr Gly Lys 

340 

Phe Tyr Gly Cys 
355 

Cys Val Asp Lys 
370 

Lys Val Val Glu 
385 

Val Asp Gin Lys 

lie Val Thr Ser 

420 

Thr Thr Phe Glu 
435 

Glu Leu Thr Arg 
450 

Glu Val Lys Asp 
465 

Glu His Glu Phe 

Pro Ser Asp Ala 

500 

Ala Gin Pro Ser 
515 

Arg Tyr Gin Asn 
530 

Phe Pro Cys Arg 
545 

Phe Thr His Gly 

Ser Gin Pro Val 

580 

He His His He 
595 

Leu Val Asn Val 
610 



295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 

380 

He Leu Gly Gly 
395 

Ala Gin He Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro Lys Arg Val 
505 

Glu Ala Ser He 

His Val Gly Met 

540 

Met Asn Gin Asn 
555 

Leu Glu Cys Phe 
570 

Lys Ala Tyr Gin 
585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Thr Val Pro 

. 350 
Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 

Asn Leu Met Leu 

Ser Asn He Cys 

560 

Pro Val Ser Glu 
575 

Lys Leu Cys Tyr 
590 

Thr Ala Cys Asp 

605 

Gin 



<210> 24 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 86 GCG 
<400> 24 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 



WO 03/018820 



PCT/IB02/04087 



-33- 



Gln Phe Glu Lys Gly Ala Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 * 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 ~ 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala . 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 " 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 ~ 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 * ^ 330 * 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 - 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 ~ 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 ~ 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 " 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 " 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 



<210> 25 
<211> 621 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 88 GCC 
<400> 25 

Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Ala Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 " 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 - 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 ^ 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 ~ 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 
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Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


Gin 


Asn 




530 






Phe 


Pro 


Cys Arg 


545 








Phe 


Thr 


His Gly 


Ser 


Gin 


Pro 


Val 








580 


He 


His 


His 


He 






595 




Leu 


Val 


Asn 


Val 




610 







His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro Lys Arg Val 
505 

Glu Ala Ser lie 

His Val Gly Met 

540 

Met Asn Gin Asn 
555 

Leu Glu Cys Phe 
570 

Lys Ala Tyr Gin 
585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 

Asn Leu Met Leu 

Ser Asn He Cys 

560 

Pro Val Ser Glu 
575 

Lys Leu Cys Tyr 
590 

Thr Ala Cys Asp 

605 

Gin 



<210> 26 
<211> 536 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep 68 88 GCC 



<400> 26 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 






180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Ala 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 



He 


Lys Val 


Pro 




10 




Ser 


Phe Val 


Asn 


25 






Ser 


Asp Met 


Asp 


Glu 


Lys Leu 


Gin 




60 


Ala 


Pro Glu 


Ala 




75 




Phe 


His Met 


His 




90 




Leu 


Gly Arg 


Phe 


105 






Tyr 


Arg Gly 


He 


Thr 


Arg Asn 


Gly 






140 


He 


Pro Asn 


Tyr 




155 




Trp 


Thr Asn 


Met 




170 




Arg 


Lys Arg 


Leu 


185 






Glu 


Gin Asn 


Lys 


Arg 


Ser Lys 


Thr 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 
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„et III Leu val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
III Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
Xle Met Ser III Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 
Pro Val III Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
Asn lly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
Thr Thr Gly Lys Thr Asn Xle Ala Glu Ala He Ala His Thr Val Pro 
Ph e Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 ^rp Trp Glu Glu Gly Lys Met Thr Ala 

Cys Val Asp Lys Met Val lie xrp J-rp 3Q J Q 

370 , 3 _ -i Tl _ T „„ Glv Glv Ser Lys Val Arg 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly wy => * 40Q 

«! Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Jy. Val Thr Lys Gin 
Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
Su His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
Pro Ser Asp Ala £sp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
Ala Gin Pro ler Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 



515 520 
Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 27 
<211> $21 
<212> PRT 

<213> Artificial Sequence 



<223> Mutant rep protein: rep 78 101 GCA 

Thr°Ala 7 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
Ly s Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
Glu Gin aL Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
Thr Slu Trp Arg Arg Val ler Lys Ala Pro Glu Ala Leu Phe Phe Val 
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Gin 


Phe 


Glu Lys 


Gly Glu Ser Tyr Phe His Met His 


Val 


Leu Val 


Glu 






85 




90 




95 




Thr Thr Gly Val 


Ala 


Ser Met 


Val Leu Gly Arg Phe 


Leu 


Ser Gin 


He 






100 






105 




110 




Arg 


Glu 


Lys Leu 


He 


Gin Arg 


He Tyr Arg Gly He 


Glu 


Pro Thr 


Leu 




115 






120 


125 






Pro 


Asn 


Trp Phe 


Ala 


Val Thr Lys Thr Arg Asn Gly 


Ala Gly Gly 


Gly 




130 




135 


140 








Asn Lys 


Val Val 


Asp 


Glu Cys 


Tyr He Pro Asn Tyr 


Leu 


Leu Pro 


Lys 


145 








150 


155 






160 


Thr 


Gin 


Pro Glu 


Leu 


Gin Trp 


Ala Trp Thr Asn Met 


Glu Gin Tyr 


Leu 








165 




170 




175 




Ser 


Ala 


Cys Leu 


Asn Leu Thr Glu Arg Lys Arg Leu 


Val 


Ala Gin 


His 






180 






185 




190 




Leu 


Thr 


His Val 


Ser 


Gin Thr 


Gin Glu Gin Asn Lys 


Glu 


Asn Gin 


Asn 






195 






200 


205 






Pro 


Asn 


Ser Asp 


Ala 


Pro Val 


lie Arg Ser Lys Thr 


Ser 


Ala Arg 


Tyr 




210 




215 


220 








Met 


Glu 


Leu Val 


Gly Trp Leu Val Asp Lys Gly He 


Thr 


Ser Glu 


Lys 


225 








230 


235 






240 


Gin 


Trp 


He Gin 


Glu Asp Gin Ala Ser Tyr He Ser 


Phe 


Asn Ala 


Ala 






245 




250 




255 




Ser 


Asn 


Ser Arg 


Ser 


Gin He 


Lys Ala Ala Leu Asp 


Asn Ala Gly 


Lys 






260 






265 




270 




He 


Met 


Ser Leu 


Thr 


Lys Thr 


Ala Pro Asp Tyr Leu 


Val 


Gly Gin 


Gin 






275 




280 


285 






Pro 


Val 


Glu Asp 


He 


Ser Ser 


Asn Arg He Tyr Lys 


He 


Leu Glu 


Leu 




290 




295 


300 








Asn Gly Tyr Asp 


Pro 


Gin Tyr 


Ala Ala Ser Val Phe 


Leu Gly Trp 


Ala 


305 








310 


315 






320 


Thr 


Lys 


Lys Phe 


Gly Lys Arg 


Asn Thr He Trp Leu 


Phe Gly Pro 


Ala 






325 




330 




335 




Thr Thr Gly Lys 


Thr 


Asn He 


Ala Glu Ala He Ala 


His 


Thr Val 


Pro 






340 






345 




350 




Phe 


Tyr Gly Cys 


Val 


Asn Trp 


Thr Asn Glu Asn Phe 


Pro 


Phe Asn 


Asp 






355 






360 


365 






Cys 


Val 


Asp Lys 


Met 


Val He 


Trp Trp Glu Glu Gly 


Lys 


Met Thr 


Ala 




370 






375 


380 








Lys 


Val 


Val Glu 


Ser 


Ala Lys 


Ala He Leu Gly Gly 


Ser 


Lys Val 


Arg 


385 








390 


395 






400 


Val Asp Gin Lys 


Cys 


Lys Ser 


Ser Ala Gin He Asp 


Pro 


Thr Pro 


Val 








405 




410 




415 




He 


Val 


Thr Ser 


Asn 


Thr Asn 


Met Cys Ala Val He 


Asp 


Gly Asn 


Ser 






420 






425 




430 




Thr 


Thr 


Phe Glu 


His 


Gin Gin 


Pro Leu Gin Asp Arg 


Met 


Phe Lys 


Phe 






435 






440 


445 






Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His Asp Phe Gly Lys 


Val 


Thr Lys 


Gin 




450 






455 


460 








Glu 


Val 


Lys Asp 


Phe 


Phe Arg 


Trp Ala Lys Asp His 


Val 


Val Glu 


Val 


465 






470 


475 






480 


Glu 


His 


Glu Phe 


Tyr 


Val Lys 


Lys Gly Gly Ala Lys 


Lys 


Arg Pro 


Ala 








485 




490 




495 




Pro Ser Asp Ala 


Asp 


He Ser Glu Pro Lys Arg Val 


Arg 


Glu Ser 


Val 






500 




505 




510 




Ala 


Gin 


Pro Ser 


Thr Ser Asp Ala Glu Ala Ser He 


Asn Tyr Ala 


Asp 






515 






520 


525 






Arg 


Tyr 


Gin Asn 


Lys 


Cys Ser 


Arg His Val Gly Met 


Asn 


Leu Met 


Leu 


530 




535 


540 








Phe 


Pro 


Cys Arg 


Gin Cys Glu Arg Met Asn Gin Asn 


Ser 


Asn He 


Cys 


545 






550 


555 






560 


Phe 


Thr 


His Gly 


Gin 


Lys Asp 


Cys Leu Glu Cys Phe 


Pro 


Val Ser 


Glu 
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ses 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 A 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 " 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 

610 615 ^ 620 

<210> 28 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 101 GCA 
<400> 28 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Ala Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 ~ ~ 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 . 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 " 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 ~ "* 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 
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Phe 


Tyr 


Gly Cys Val 


Asn 


Trp 


Thr 


Asn 




355 






360 




Cys 


Val 


Asp Lys Met 


Val 


He 


Trp 


Trp 


370 






375 






Lys 


Val 


Val Glu Ser 


Ala 


Lys 


Ala 


He 


385 






390 








Val 


Asp 


Gin Lys Cys 


Lys 


Ser 


Ser 


Ala 






405 










He 


Val 


Thr Ser Asn 


Thr 


Asn 


Met 


Cys 






420 








425 


Thr 


Thr 


Phe Glu His 


Gin 


Gin 


Pro 


Leu 






435 






440 




Glu 


Leu 


Thr Arg Arg Leu Asp 


His 


Asp 




450 






455 






Glu 


Val 


Lys Asp Phe 


Phe Arg Trp Ala 


465 






470 








Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys 


Gly 






485 










Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 






500 








505 


Ala 


Gin 


Pro Ser Thr Ser Asp Ala 


Glu 






515 






520 




Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 




530 






535 







<210> 29 

<211> 621 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 124 



<400> 29 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


He 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 








20 










25 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 




35 










40 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 




50 










55 






Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


65 










70 








Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 










85 










Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 






100 










105 


Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 




115 










120 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 




130 










135 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


145 










150 








Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 










165 










Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 






180 










185 


Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 






195 










200 




Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


lie Arg 
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Glu Asn 


Phe 


Pro 


Phe 


Asn 


Asp 






365 








Glu Glu 


Gly Lys 


Met 


Thr 


Ala 




380 










Leu Glv 


Gly Ser Lys 


Val 


Arg 


395 










400 


Gin He 


Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Ala Val 


He Asp Gly Asn 


Ser 








430 






Gin Asp 


Arg Met 


Phe 


Lys 


Phe 






445 








Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 




460 










Lys Asp 


His 


Val 


Val 


Glu 


Val 


475 










480 


Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 








495 




Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 






510 






Ala Ser 


He 


Asn 


Tyr 


Ala 


Asp 




525 








GCC 












Lys Val 


Pro 


Ser 


Asp 


Leu 


Asp 


10 








15 




Phe Val 


Asn Trp 


Val 


Ala 


Glu 








30 






Asp Met 


Asp 


Leu 


Asn 


Leu 


He 






45 








Lys Leu 


Gin Arg Asp 


Phe 


Leu 


60 










Pro Glu 


Ala 


Leu 


Phe 


Phe 


Val 


75 










80 


His Met 


His 


Val 


Leu 


Val 


Glu 


90 








95 




Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 






110 






Arg Gly 


Ala 


Glu 


Pro 


Thr 


Leu 




125 








Arg Asn 


Gly Ala Gly Gly 


Gly 


140 










Pro Asn 


Tyr Leu Leu 


Pro 


Lys 


155 










160 


Thr Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 








175 




Lys Arg 


Leu 


Val 


Ala 


Gin 


His 






190 






Gin Asn 


Lys 


Glu 


Asn 


Gin 


Asn 






205 








Ser Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 
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210 








215 






Met 


Glu 


Leu 


Val Gly 


Trp 


Leu 


Val 


Asp 


225 








230 








Gin 


Trp 


lie 


Gin Glu 


Asp 


Gin 


Ala 


Ser 






245 










Ser 


Asn 


Ser Arg Ser 


Gin 


He 


Lys 


Ala 








260 








265 


He 


Met 


Ser 


Leu Thr 


Lys 


Thr 


Ala 


Pro 






275 








280 




Pro 


Val 


Glu Asp He 


Ser 


Ser 


Asn 


Arg 




290 








295 






Asn Gly 


Tyr Asp Pro 


Gin 


Tvr 


Ala 


Ala 


"3 ft C 








310 








TilX 


Lys 


Lys 


Phe Gly 


Lvs 


Arcr 


Asn 


Thr 








325 










Tnr 


Tnr 


Gly Lys Thr 




He 


Ala 


Glu 








340 








345 


Pne 


Tyr 


Gly 


Cys Val 


Asn 


Tro 

XT 


Thr 


Asn 






355 








360 




Cys 


Val 


Asp 


Lys Met 


Val 


He 


Tro 


Trr> 




370 








375 






Lys 


Val 


Val 


Glu Ser 


Ala 


Lvs 


Ala 


He 


385 








390 








Val 


Asp 


Gin 


Lys Cys 


Lvs 


Ser 


Ser 


Ala 








405 










He 


Val 


Thr 


Ser Asn 


Thr" 

_L Ail 


Asn 


Met 


Cvs 








420 








425 


Thr 


Thr 


Phe 


Glu His 


Gin 


Gin 


Pro 


Leu 






435 








440 




Glu 


Leu 


Thr 


Arg Arg 


Leu 


Asp 


His 


Asp 




450 








455 






Glu 


Val 


Lys 


Asp Phe 


Phe 


Arg 


Trp 


Ala 


465 








470 








Glu 


His 


Glu 


Phe Tyr 


Val 


Lys 


Lys 


Gly 








485 










Pro 


Ser 


Asp 


Ala Asp 


He 


Ser 


Glu 


Pro 






500 








505 


Ala 


Gin 


Pro 


Ser Thr 


Ser 


Asp 


Ala 


Glu 






515 








520 




Arg 


Tyr 


Gin 


Asn Lys 


Cys 


Ser 


Arg 


His 




530 








535 






Phe 


Pro 


Cys 


Arg Gin 


Cys 


Glu 


Arg 


Met 


545 








550 








Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu 








565 










Ser 


Gin 


Pro 


Val Ser 


Val 


val 


Lys 


Lys 








580 








585 


He 


His 


His 


He Met 


Gly 


Lys 


Val 


Pro 






595 








600 




Leu 


Val 


Asn Val Asp 


Leu 


Asp 


Asp 


Cys 




610 








615 







<210> 30 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 124 
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220 



Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 




235 










240 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 


250 










255 




Ala 


Leu 


Asp 


Asn 


Ala 


Gly 


Lys 










270 






Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 








285 








He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 






300 










Ser 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 




315 










320 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 


330 










335 




Ala 


He 


Ala 


His 


Thr 


Val 


Pro 










350 






Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 








365 








Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 






380 










Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Arg 




395 










400 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Ala 


Val 


He 


Asp 


Gly 


Asn 


Ser 










430 






Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 








445 








Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 






460 










Lys 


Asp 


His 


Val 


Val 


Glu 


Val 


475 










480 


Gly 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 










495 




Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 










510 






Ala 


Ser 


He 


Asn 


Tyr 


Ala 


Asp 








525 








Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 




540 










Asn 


Gin 


Asn 


Ser 


Asn 


He 


Cys 




555 










560 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 


570 








575 




Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 










590 






Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 








605 








He 


Phe 


Glu 


Gin 









620 



GCC 



<400> 30 
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Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

1^5 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu. Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 " 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 "* 75 .. 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly Ala Glu Pro Thr Leu 

115 120 ' 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 ^ 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 

290 295 ~ 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 " 330 * 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 * 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
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485 

Pro Ser Asp Ala Asp He Ser Glu 

500 

Ala Gin Pro Ser Thr Ser Asp Ala 
515 520 
Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 31 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 125 GCG 



<400> 31 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys 


Val 


Pro 


Ser Asp 


Leu 


Asp 


1 






5 








10 








15 


Glu 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser Phe 


Val 


Asn 


Trp Val 


Ala 








20 








25 






30 




He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp 


Met 


Asp 


Leu Asn 


Leu 




35 










40 








45 


Phe 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys 


Leu 


Gin Arg Asp 


Leu 




50 










55 








60 




Phe 


val 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro 


Glu 


Ala 


Leu Phe 


65 






70 








75 






Val 


80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His 


Met 


His 


Val Leu 


Glu 








85 








90 








95 


He 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly 


Arg 


Phe 


Leu Ser 


Gin 






100 








105 






110 


Thr 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr Arg 


Gly 


He 


Ala Pro 


Leu 




115 










120 


* 






125 




Gly 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg 


Asn 


Gly Ala Gly Gly 




13 0 








135 








140 








Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro 


Asn 


Tyr 


Leu Leu 


Pro 


Lys 


145 






150 








155 








160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr 


Asn 


Met 


Glu Gin Tyr 


Leu 










165 






170 








175 


His 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg Lys 


Arg 


Leu 


Val Ala 


Gin 






180 










185 






190 


Gin 




Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin 


Asn 


Lys 


Glu Asn 


Asn 






195 










200 








205 




Tyr 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg Ser 


Lys 


Thr 


Ser Ala Arg 




210 








215 








220 




Glu 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys 


Gly 


He 


Thr Ser 


Lys 


225 








230 








235 






Ala 


240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser 


Phe Asn 


Ala 








245 








250 








255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala 


Leu 


Asp 


Asn Ala 


Gly 


Lys 








260 










265 






270 




Gin 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val Gly Gin 






275 








280 








285 


Glu 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He 


Tyr 


Lys 


He Leu 


Leu 




290 








295 








300 






Ala 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser 


Val 


Phe 


Leu Gly Trp 


305 




310 






315 








320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He 


Trp 


Leu 


Phe Gly 


Pro 


Ala 








325 








330 








335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


lie 


Ala 


Glu Ala 


He 


Ala 


His Thr 


Val 


Pro 






340 










345 






350 







490 
Pro Lys 
505 

Glu Ala 



Arg Val 
Ser He 



Arg Glu 
510 
Asn Tyr 
525 



495 

Ser Val 
Ala Asp 
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Phe 


Tyr 


Gly Cys Val Asn Trp 


Thr 


Asn Glu Asn 


Phe 


Pro 


Phe 


Asn Asp 




355 








360 






365 








Cys 


Val 


Asp 


Lys 


Met 


Val He 


Trp 


Trp Glu Glu Gly 


Lys 


Met 


Thr 


Ala 


370 








375 






380 










Lys 


Val 


Val 


Glu 


Ser 


Ala Lys 


Ala 


He Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


385 










390 




395 










400 


Val 


Asp 


Gin Lys 


Cys 


Lys Ser 


Ser 


Ala Gin He Asp 


Pro 


Thr 


Pro 


Val 








405 






410 








415 




lie 


Val 


Thr 


Ser 


Asn 


Thr Asn 


Met 


Cys Ala Val 


He 


Asp 


Gly Asn Ser 








420 








425. 






430 






Thr 


Thr 


Phe 


Glu 


His 


Gin Gin 


Pro 


Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






435 








440 






445 








Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp Phe Gly Lys 


Val 


Thr 


Lys 


Gin 




450 








455 






460 










Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp 


Ala Lys Asp 


His 


Val 


Val 


Glu 


Val 


465 








470 




475 










480 


Glu 


His 


Glu 


Phe 


Tyr 


Val Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 










485 






490 








495 




Pro 


Ser 


Asp Ala Asp 


He Ser 


Glu 


Pro Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 








500 








505 






510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp Ala 


Glu Ala Ser 


He 


Asn 


Tyr Ala Asp 






515 








520 






Co tr 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser Arg 


His Val Gly Met 


Asn 


Leu 


Met 


Leu 




530 








535 






540 










Phe 


Pro 


Cys 


Arg 


Gin 


Cys Glu Arg 


Met Asn Gin 


Asn 


Ser 


Asn 


He 


Cys 


545 






550 




555 










560 


Phe 


Thr 


His 


Gly Gin Lys Asp 


Cys 


Leu Glu Cys 


Phe 


Pro 


Val 


Ser 


Glu 










565 






570 . 








575 




Ser 


Gin 


Pro 


Val 


Ser 


Val Val 


Lys 


Lys Ala Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 








580 








585 






590 






lie 


His 


His 


He 


Met 


Gly Lys 


Val 


Pro Asp Ala 


Cys 


Thr 


Ala 


Cys 


Asp 






595 








600 






605 








Leu 


Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 


Cys He Phe 


Glu 


Gin 










610 






615 






620 










<210> 32 






















<211> 536 






















<212> PRT 






















<213> Artificial Sequence 
















<220> 
























<223> Mutant rep protein: rep68 


125 GCG 












<400> 32 






















Thr 


Ala 


Gly 


Phe 


Tyr 


Glu He 


Val 


He Lys Val 


Pro 


Ser 


Asp 


Leu 


Asp 


1 






5 






10 








15 




Glu 


His 


Leu 


Pro 


Gly 


He Ser 


Asp 


Ser Phe Val 


Asn 


Trp 


Val 


Ala 


Glu 








20 






25 






30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro Pro 


Asp 


Ser Asp Met 


Asp 


Leu 


Asn 


Leu 


He 




35 








40 






45 








Glu 


Gin 


Ala 


Pro 


Leu 


Thr Val 


Ala 


Glu Lys Leu 


Gin 


Arg 


Asp 


Phe 


Leu 




50 








55 






60 










Thr 


Glu 


Trp 


Arg 


Arg 


Val Ser 


Lys 


Ala Pro Glu 


Ala 


Leu 


Phe 


Phe 


Val 


65 




70 


75 










80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu Ser 


Tyr 


Phe His Met 


His 


Val 


Leu 


Val 


Glu 








85 






90 








95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser Met 


Val 


Leu Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 






100 








105 






110 






Arg 


Glu 


Lys 


Leu 


lie 


Gin Arg 


He 


Tyr Arg Gly 


He 


Ala 


Pro 


Thr 


Leu 




115 








120 






125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val Thr 


Lys 


Thr Arg Asn Gly 


Ala 


Gly Gly Gly 
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130 








135 




Asn 


Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


145 








150 






Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp 


Ala 








165 






Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His Val 


Ser 


Gin 


Thr 


Gin 






195 








200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




210 








215 




Met 


Glu 


Leu Val 


Glv 


Tro 

tr 


Leu 


Val 


225 








230 






Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 






245 








Ser 


Asn 


Ser Arg 


Ser 

**** ^» 


Gin 


He 


Lys 






260 










lie 


Met 


Ser Leu 


Thr 


Lvs 


Thr 


Ala 






275 








280 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 






295 




Asn Gly Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


305 








310 






Thr 


Lys 


Lys Phe 


Glv 


Lys 


Arg 


Asn 








325 








Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 






340 










Phe Tyr Gly Cys 


Val 


Asn 


Trp 


Thr 






355 








360 


Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 

tr 




370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 








405 








He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






420 










Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 






435 - 








440 


Glu 


Leu Thr Arg 


Arg 


Leu Asp 


His 




450 








455 




Glu 


Val 


Lys Asp 


Phe 


Phe Arg 


Trp 


465 








470 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 








485 








Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 






500 










Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 






515 








520 


Arg Leu Ala Arg Gly 


His 


Ser 


Leu 




530 








535 
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<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 
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140 



He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 






155 










160 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 




170 










175 




Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 


185 










190 






Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 








205 








Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 








220 










Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 






235 










240 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 




250 










255 




Ala 


Ala 


Leu 


Asp Asn 


Ala Gly 


Lys 


265 










270 






Pro 


Asp 


Tyr 


Leu 


Val 


Gly Gin 


Gin 










285 








Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 








300 










Ala 


Ser 


Val 


Phe 


Leu 


Gly Trp 


Ala . 






315 










320 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 










365 






Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 








380 










He 


Leu Gly 


Gly 


Ser 


Lys 


Val 


Arg 






395 










400 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 




410 










415 




Cys 


Ala 


Val 


He 


Asp 


Gly Asn 


Ser 


425 










430 






Leu 


Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 










445 








Asp 


Phe Gly 


Lys 


Val 


Thr Lys 


Gin 








460 










Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 






475 










480 


Gly 


Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 




490 










495 




Pro 


Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 










510 






Glu 


Ala 


Ser 


He 


Asn 


Tyr Ala 


Asp 










525 









127 GCT 



<400> 33 
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Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 no 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Ala Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val. Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 " 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
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485 






Pro 


Ser 


Asp Ala Asp 


He 


Ser 






500 








Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 






515 








Arg 


Tyr 


Gin Asn 


Lys 


Cys 


Ser 


530 








535 


Phe 


Pro 


Cys Arg 


Gin 


Cys 


Glu 


545 






550 




Phe 


Thr 


His Gly 


Gin 


Lys 


Asp 








565 






Ser 


Gin 


Pro Val 


Ser 


Val 


Val 






580 








He 


His 


His He 


Met 


Gly Lys 






595 








Leu 


Val 


Asn Val 


Asp 


Leu 


Asp 




610 








615 
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490 495 
Glu Pro Lys Arg Val Arg Glu Ser Val 

505 510 
Ala Glu Ala Ser He Asn Tyr Ala Asp 
520 525 
Arg His Val Gly Met Asn Leu Met Leu 

540 

Arg Met Asn Gin Asn Ser Asn He Cys 

555 560 
Cys Leu Glu Cys Phe Pro Val Ser Glu 

570 575 
Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

585 590 
Val Pro Asp Ala Cys Thr Ala Cys Asp 
600 605 
Asp Cys He Phe Glu Gin 

62 0 



<210> 34 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 127 GCT 



<400> 34 












He 




Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


Lys 


1 






5 










10 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 








20 








25 




Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 




35 










40 






Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 




50 










55 




Ala 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Pro 


65 








70 








His 
90 


Gin 


Phe 


Glu 


Lys 


Gly 
85 


Glu 


Ser 


Tyr 


Phe 


Thr 


Thr Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly 








100 










105 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg 




115 










120 






Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg 




130 








135 








Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


145 








150 








Thr 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala Trp 








165 










170 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 


Lys 






180 










185 




Leu 


Thr 


His 
195 


Val 


Ser 


Gin 


Thr 


Gin 
200 


Glu 


Gin 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He Arg 


Ser 




210 








215 


Val 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Asp 


Lys 


225 










230 




Ala 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ser 


Tyr 








245 










250 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala 






260 










265 





15 

Val Asn Trp Val Ala Glu 

30 

Met Asp Leu Asn Leu He 
45 

Leu Gin Arg Asp Phe Leu 
60 

Glu Ala Leu Phe Phe Val 
75 80 
Met His Val Leu Val Glu 

95 

Arg Phe Leu Ser Gin He 

110 

Gly He Glu Pro Ala Leu 
125 

Asn Gly Ala Gly Gly Gly 
140 

Asn Tyr Leu Leu Pro Lys 
155 160 
Asn Met Glu Gin Tyr Leu 

175 

Arg Leu Val Ala Gin His 

190 

Asn Lys Glu Asn Gin Asn 
205 

Lys Thr Ser Ala Arg Tyr 
220 

Gly He Thr Ser Glu Lys 
235 240 
He Ser Phe Asn Ala Ala 

255 

Leu Asp Asn Ala Gly Lys 

270 
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lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val Gly Gin Gin 






275 










280 






285 


Leu Glu Leu 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser Asn Arg 


He Tyr Lys 


He 




290 










295 






300 






Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr Ala 


Ala 


Ser Val Phe 


Leu Gly Trp Ala 


305 








310 








315 




320 


Thr 


Lvs 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He Trp Leu 


Phe Gly Pro Ala 






325 










330 




335 


Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala He Ala 


His 


Thr Val Pro 








340 










345 






350 


Phe 


Tvr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu Asn Phe 


Pro 


Phe Asn Asp 




355 










360 






365 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly 


Lys 


Met Thr Ala 


370 






375 






380 




Lys Val Arg 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


385 










390 






395 




. 400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin He Asp 


Pro 


Thr Pro Val 






405 










410 




415 


lie 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Ala Val He 


Asp 


Gly Asn Ser 
430 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met 


Phe Lys Phe 






435 










440 






445 


Thr Lys Gin 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe Gly Lys 


Val 




450 






455 






460 




Val Glu Val 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg Trp Ala Lys Asp His 


Val 


465 






470 








475 




480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg Pro Ala 










485 










490 




495 


Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys Arg Val 


Arg 


Glu Ser Val 








500 










505 






510 


Ala 


Gin 


Pro 
515 


Ser 


Thr 


Ser 


Asp 


Ala 
520 


Glu 


Ala Ser He 


Asn 
525 


Tyr Ala Asp 


Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 












530 










535 













<210> 35 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 132 GCC 



<400> 35 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys Val Pro 


Ser Asp Leu 


Asp 


1 






5 








10 


15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser Phe Val Asn 


Trp Val Ala 


Glu 








20 








25 


30 


He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp 


Leu Asn Leu 




35 










40 




45 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys Leu Gin 


Arg Asp Phe 


Leu 




50 










55 




60 




Val 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro Glu Ala 


Leu Phe Phe 


65 




70 






75 




80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His Met His 


Val Leu Val 


Glu 








85 








90 


95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg phe 


Leu Ser Gin 


He 






100 








105 


110 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly He 


Glu Pro Thr 


Leu 




115 










120 




125 




Pro 


Asn 


Trp 


Ala 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly 


Ala Gly Gly Gly 
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130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 " 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 ~ 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

* 325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cvs Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 - 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 ~ 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 ~ 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arq Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 " ~ 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 " 615 620 
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<210> 36 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 132 GCC 
<400> 36 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 io 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
€5 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 no 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Ala Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 ' 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 A 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 ~ ^ 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 ~ 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 " 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 
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He 


Val 


Thr Ser Asn 


Thr Asn 


Met 






420 






Thr 


Thr 


Phe Glu His 


Gin Gin 


Pro 






435 




440 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 




450 




455 




Glu 


Val 


Lys Asp Phe 


Phe Arg Trp 


465 




470 




Glu 


His 


Glu Phe Tyr Val Lys Lys 






485 






Pro 


Ser 


Asp Ala Asp 


He Ser 


Glu 






500 






Ala 


Gin 


Pro Ser Thr 


Ser Asp 


Ala 






515 




520 


Arg 


Leu 


Ala Arg Gly His Ser 


Leu 


530 




535 





<210> 37 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 



<400> 37 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 










150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 




210 










215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 








260 










He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 



Cys Ala Val 


He Asp 


Gly Asn 


Ser 


425 






430 






Leu Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 




445 








Asp Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 


460 










Ala Lys Asp 


His 


Val 


Val 


Glu 


Val 


475 










480 


Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 








495 




Pro Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 






510 






Glu Ala Ser 


He 


Asn 
525 


Tyr 


Ala 


Asp 



140 GCC 



He 


Lys 
10 


Val 


Pro 


Ser 


Asp 


Leu 
15 


Asp 


Ser 


Phe 


Val 


Asn Trp 


Val 


Ala 


Glu 


25 










30 






Ser Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 










45 








Glu 


Lys 


Leu 


Gin 


Arg 


Asp 


Phe 


Leu 






60 










Ala 


Pro 


Glu 
75 


Ala 


Leu 


Phe 


Phe 


Val 
80 


Phe 


His 
90 


Met 


His 


Val 


Leu 


Val 
95 


Glu 


Leu 


Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 


105 










110 






Tyr Arg Gly 


He 


Glu 


Pro 


Thr 


Leu 










125 








Thr Arg Asn 


Ala 


Ala 


Gly Gly 


Gly 








140 










He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 






155 








160 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 










175 




Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 


185 










190 






Glu 


Gin 


Asn 


Lys 


Glu 
205 


Asn 


Gin 


Asn 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 






220 










Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 




235 










240 


Ser Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 




250 










255 




Ala 


Ala 


Leu 


Asp Asn 


Ala 


Gly 


Lys 


265 










270 






Pro 


Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 
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275 280 285 

Pro Val Glu Asp He Seir Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 * 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 . 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 ~ 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

5-80 585 " 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 * 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 38 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 140 GCC 
<400> 38 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 io 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 
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Thr 


Glu 


Trp Arg Arg 


Val 


Ser 


Lys 


Ala Pro 


Glu 


Ala 


Leu 


Phe 


Phe Val 


65 




70 








75 








80 


Gin 


Phe 


Glu Lys Gly Glu Ser 


Tyr 


Phe His 


Met 


His 


Val 


Leu 


Val Glu 






85 








90 










95 


Thr 


Thr 


Gly Val Lys 


Ser 


Met 


Val 


Leu Gly Arg 


Phe 


Leu 


Ser 


Gin He 






100 








105 








110 




Arg 


Glu 


Lys Leu He Gin Arg 


He Tyr Arg Gly 


He 


Glu 


Pro 


Thr Leu 




115 






120 








125 






Pro 


Asn 


Trp Phe Ala 


Val 


Thr 


Lys 


Thr Arg 


Asn 


Ala 


Ala 


Gly Gly Gly 




130 






135 








140 








Asn 


Lys 


Val Val Asp Glu Cys 


Tyr 


He Pro 


Asn 


Tyr 


Leu 


Leu 


Pro Lys 


145 




150 








155 








160 


Thr 


Gin 


Pro Glu Leu 


Gin 


Trp 


Ala 


Trp Thr 


Asn 


Met 


Glu 


Gin 


Tyr Leu 






165 








170 










175 


Ser 


Ala 


Cys Leu Asn 


Leu 


Thr 


Glu Arg Lys Arg 


Leu 


Val 


Ala 


Gin His 






180 








185 








190 




Leu 


Thr 


His Val Ser 


Gin 


Thr 


Gin 


Glu Gin 


Asn 


Lys 


Glu 


Asn 


Gin Asn 






195 






200 








205 






Pro 


Asn 


Ser Asp Ala 


Pro 


Val 


He Arg Ser 


Lys 


Thr 


Ser 


Ala 


Arg Tyr 




210 






215 








220 








Met 


Glu 


Leu Val Gly Trp 


Leu 


Val 


Asp Lys 


Gly 


He 


Thr 


Ser 


Glu Lys 


225 






230 








235 








240 


Gin 


Trp 


He Gin Glu Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser 


Phe 


Asn 


Ala Ala 




245 








250 










255 


Ser 


Asn 


Ser Arg Ser 


Gin 


He 


Lys 


Ala Ala 


Leu 


Asp 


Asn Ala Gly Lys 






260 








265 








270 




lie 


Met 


Ser Leu Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 






275 






280 








285 






Pro 


Val 


Glu Asp He 


Ser 


Ser 


Asn Arg He 


Tyr 


Lys 


He 


Leu 


Glu Leu 




290 




295 








300 








Asn 


Gly 


Tyr Asp Pro 


Gin 


Tyr 


Ala 


Ala Ser 


Val 


Phe 


Leu Gly Trp Ala 


305 






310 








315 








320 


Thr 


Lys 


Lys Phe Gly Lys Arg 


Asn 


Thr He 


Trp 


Leu 


Phe 


Gly 


Pro Ala 






325 








330 










335 


Thr 


Thr 


Gly Lys Thr 


Asn 


He 


Ala 


Glu Ala 


He 


Ala 


His 


Thr 


Val Pro 






340 








345 








350 




Phe 


Tyr 


Gly Cys Val Asn Trp 


Thr 


Asn Glu 


Asn 


Phe 


Pro 


Phe 


Asn Asp 






355 _ 






360 








365 






Cys 


Val 


Asp Lys Met 


Val 


He 


Trp Trp Glu Glu Gly 


Lys 


Met 


Thr Ala 




370 




375 








380 








Lys 


Val 


Val Glu Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly 


Ser 


Lys 


Val Arg 


385 






390 








395 








400 


Val 


Asp 


Gin Lys Cys 


Lys 


Ser 


Ser 


Ala Gin 


He 


Asp 


Pro 


Thr 


Pro Val 






405 








410 










415 


He 


Val 


Thr Ser Asn 


Thr 


Asn 


Met 


Cys Ala 


Val 


He 


Asp Gly Asn Ser 






420 








425 








430 




Thr 


Thr 


Phe Glu His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met 


Phe 


Lys Phe 






435 






440 








445 






Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp Phe 


Gly Lys 


Val 


Thr 


Lys Gin 




450 






455 








460 








Glu 


Val 


Lys Asp Phe 


Phe Arg 


Trp Ala Lys Asp His 


Val 


Val 


Glu Val 


465 






470 








475 








480 


Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro Ala 






485 








490 










495 


Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro Lys- Arg Val 


Arg 


Glu 


Ser Val 






500 








5Q5 








510 




Ala 


Gin 


Pro Ser Thr 


Ser Asp 


Ala 


Glu Ala 


Ser 


He 


Asn Tyr Ala Asp 






515 






520 








525 






Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 
















530 






535 
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<210> 39 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 161 GCC 
<400> 39 

Thr Ala Gly Phe Tyr Glu lie Val He Lys Val Pro Ser Asp Leu Asp 

15 io 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 " no 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Ala Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Aro Tvr 

210 215 ~ 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
22 5 230 ~ 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 ' 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

, 355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arcr 
385 390 395 40 0 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 * 415 
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lie 


Val 


Thr 


Ser 


Asn 


Thr Asn 


Met 








420 








THt" 


Thr 


Phe 


Glu 


His 


Gin Gin 


Pro 






435 








440 


Ul LA 




Thr Arg Arg 


Leu Asp 


His 




450 








455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg 


Trp 


465 










470 




Glu 


His 


Glu 


Phe 


Tyr Val Lys 


Lys 










485 






Pro 


Ser 


Asp Ala Asp 


He Ser 


Glu 








500 








Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 






515 








520 


Arg 


Tyr 


Gin Asn 


Lys 


Cys Ser Arg 




530 








535 




Phe 


Pro 


Cys Arg 


Gin 


Cys Glu Arg 


545 










550 




Phe 


Thr 


His 


Gly 


Gin 


Lys Asp 


Cys 










565 






Ser 


Gin 


Pro 


Val 


Ser 


Val Val 


Lys 








580 








He 


His 


His 


He 


Met 


Gly Lys 


Val 






595 








600 


Leu 


Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 




610 








615 





<210> 40 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 



<400> 40 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 






115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 










135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 










150 






Ala 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 








180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 



Cys 


Ala 


Val He 


Asp 


Gly Asn Ser 


425 








430 






Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 








445 








Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 






460 










Ala 


Lys 


Asp His 


Val 


Val 


Glu 


Val 




475 








480 


Gly 


Gly Ala Lys Lys 


Arg 


Pro 


Ala 




490 








495 




Pro 


Lys Arg Val 


Arg 


Glu 


Ser 


Val 


505 








510 






Glu 


Ala 


Ser He 


Asn 


Tyr Ala Asp 








525 








His 


Val 


Gly Met 


Asn 


Leu 


Met 


Leu 






540 










Met 


Asn 


Gin Asn 


Ser 


Asn 


He 


Cys 






555 








560 


Leu 


Glu 


Cys Phe 


Pro 


Val 


Ser 


Glu 




570 






575 




Lys 


Ala 


Tyr Gin 


Lys 


Leu 


Cys 


Tyr 


585 








590 






Pro 


Asp 


Ala Cys 


Thr 


Ala 


Cys 


Asp 






605 








Cys 


He 


Phe Glu 


Gin 









620 



161 GCC 



He 


Lys Val Pro 


Ser 


Asp 


Leu Asp 




10 






15 




Ser 


Phe Val Asn 


Trp 


Val 


Ala 


Glu 


25 






30 






Ser 


Asp Met Asp 


Leu 


Asn 


Leu 


He 




45 








Glu 


Lys Leu Gin Arg 


Asp 


Phe 


Leu 




60 










Ala 


Pro Glu Ala 


Leu 


Phe 


Phe 


Val 




75 








80 


Phe 


His Met His 


Val 


Leu 


Val 


Glu 




90 






95 




Leu 


Gly Arg Phe 


Leu 


Ser 


Gin 


He 


105 




110 






Tyr 


Arg Gly He 


Glu 


Pro 


Thr 


Leu 






125 








Thr 


Arg Asn Gly Ala 


Gly Gly Gly 




140 










He 


Pro Asn Tyr 


Leu 


Leu 


Pro 


Lys 




155 








160 


Trp 


Thr Asn Met 


Glu 


Gin 


Tyr 


Leu 




170 






175 




Arg 


Lys Arg Leu 


Val 


Ala 


Gin 


His 


185 






190 






Glu 


Gin Asn Lys 


Glu 


Asn 


Gin 


Asn 
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195. 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 ~ 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 * 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 . * 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 " 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg- 
385 390 395 * 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 ~ 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 ^ 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 ~~ 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 41 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep7 8 163 GCT 
<400> 41 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 
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Thr 


Glu 


Tro 


Arc? 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu Ala 


Leu 


Phe 


Phe 


Val 


65 






70 










75 








80 




rue 


Glu 


Lvs 


Gly Glu 


Ser 


Tyr 


Phe 


His 


Met His 


Val 


Leu 


Val 


Glu 








85 










90 








95 




X 111 


TVi t* 

lll-L 


uxy 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe 


Leu 


Ser 


Gin 


He 






100 








105 








110 








m n 


Xiy o 


ucu 


He 


Gin 


Arg 


He 


Tyr Arg Gly He 


Glu 


Pro 


Thr 


Leu 




115 

JL JL J 










120 








125 








rro 


TV 

ash 


xrp 


lr 11C 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala 


Gly Gly Gly 




130 








135 








140 










Asn 


lays 


v ax 


Vol 


Asp 


Glu 


Cys 


Tyr 


He 


Pro Asn Tyr 


LcU 


Leu 


Pro 


Lys 


JL fx 3 






150 










155 








160 


Thx 


Gin 


Ala 


uXU 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn Met 


fill, 
V3XU 


Gin 


Tyr 


Leu 








165 








170 








175 


His 


ber 


Ala 




Xj<=U 


Asn 


Leu 


Thr Glu Arg 


Lys 


Arg Leu 


vai 


Ala 


Gin 






180 










185 








190 


Gin 




lieu 


r r ' I"* 


XIX s 


Vdl 


Ser 


Gin 


Thr Gin Glu Gin Asn Lys 


Pi ii 
Kj LLl 


Asn 


Asn 






195 










200 








*i n c 

205 








P3TO 


Asn 


t>er 


Hyp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys Thr 


O a -y— 

Dei 


Ala Arg 


Tyr 




Z1U 








215 








220 






Glu 




Met 


bill 


JjcU 


Vdl 


Gly Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


X XXX 


Ser 


Lys 


O 9 5 










230 










235 








240 


Gin 


Trp 


lie 


bin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He Ser 


rile 


Asn 


Ala 


Ala 








245 










250 








255 




&er 


Asn 


OCl 




Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


7\ en 

M.SXX 


Ala Gly Lys 








^ U u 










265 








270 






lie 


wet: 


O ■v"' 


lieu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


vax 


Gly Gin Gin 






275 








280 












Glu 






VdJ. 


urXU 




He 


Ser 


Ser Asn Arg 


He 


Tyr Lys 


He 

-J» ^ Vi. 


Leu 


Leu 




290 

A -y V 








295 








300 








Ala 




VJX 


TSrv" 

iyxr 




Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp 


J U J 




310 










315 








320 


XII XT 


T .ifB 
liyo 


xjy »a 


Phe 

rue 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 






325 










330 








335 




X XIX 


X XIX 


V3JL 




Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 






340 

w 










345 








350 






DVip 




Glv 


(Vg 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 




355 








360 








365 






Ala 


uys 


V CL-L 




xjyt? 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly 


Lys 


Met 


Thr 


370 






375 








380 






Val 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Lys 


Arg 


385 










390 










395 








400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 






405 










410 








415 




He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val He 


Asp 


Gly Asn 


Ser 








420 










425 








430 




Phe 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg 


Met 


Phe 


Lys 






435 










440 








445 






Gin 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp Phe Gly Lys 


Val 


Thr 


Lys 




450 








455 








460 








Val 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg Trp Ala Lys Asp His 


Val 


Val 


Glu 


465 






470 










475 








480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 










485 










490 








495 


Val 


Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg Val 


Arg 


Glu 


Ser 






500 








505 








510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp Ala Glu Ala Ser He 


Asn 


Tyr Ala Asp 




515 










520 








525 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg His Val Gly Met 


Asn 


Leu 


Met 


Leu 


530 










535 








540 






He 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 


Met 


Asn 


Gin Asn 


Ser 


Asn 


Cys 
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545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 ' 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 ~ 615 620 

<210> 42 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 163 GCT 



15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 



<400> 42 
















Val 




Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Pro 


1 






5 










10 






Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn 








20 








25 








Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met 


Asp 




35 










40 








Gin 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 




50 










55 










60 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


65 






70 










75 


His 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 








85 










90 




Phe 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg 






100 








105 








Arg 


Glu 


Lys 


Leu 


He 


Glri 


Arg 


He 


Tyr Arg Gly 


He 




115 










120 










Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn Gly 




130 








135 










140 


Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro Asn Tyr 


145 






150 










155 




Thr 


Gin 


Ala 


Glu 


Leu 


Gin 


Trp 


Ala Trp 


Thr 


Asn 


Met 










165 


• 








170 






Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 


Lys 


Arg 


Leu 






180 










185 








Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn 


Lys 






195 










200 








Thr 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He Arg 


Ser 


Lys 




210 








215 










220 


Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val Asp 


Lys 


Gly 


He 


225 








230 










235 




Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 








245 










250 






Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 








260 










265 








He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr 


Leu 






275 










280 










Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


He 


Tyr 


Lys 


290 








295 










300 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


305 






310 








He 


315 




Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


Trp 


Leu 






325 










330 







WO 03/018820 



PCT/IB02/04087 



-58- 



Til IT 


TVrr 


Glv 


Lvs 








340 


Phe 


Tvr 


Glv 


Cvs 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


lie 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr 


Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Leu 


Ala 


Arg 



530 



Thr 


Asn 


He 


Ala 


Val 


Asn 


Trp 


Thr 








360 


Met 


Val 


He 


Trp 






375 




Ser 


Ala 


Lys 


Ala 




390 






Cys 


Lys 


Ser 


Ser 


405 








Asn 


Thr 


Asn 


Met 


His 


Gin 


Gin 


Pro 








440 


Arg 


Leu 


Asp 


His 






455 




Phe 


Phe 


Arg 


Trp 




470 






Tyr 


Val 


Lys 


Lys 


485 








Asp 


He 


Ser 


Glu 


Thr 


Ser 


Asp 


Ala 








520 


Gly His 


Ser 


Leu 






535 





Glu Ala He Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 

380 

He Leu Gly Gly 
395 

Ala Gin He Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro Lys Arg Val 
505 

Glu Ala Ser He 



His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 



<210> 43 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep78 175 GCT 



<400> 43 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Ala Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
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195 








200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




210 






215 




Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


225 








230 






Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 






245 








Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 






260 










lie 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 






275 








230 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 






295 




Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


305 






310 






Thr 


Lys 


Lys Phe Gly 


Lys Arg Asn 






325 








Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 






340 










Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 




355 








360 


Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 






405 








lie 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






420 










Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 






435 








440 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 




450 








455 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


465 






470 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 








485 








Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 






5"00 










Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 






515 








520 


Arg 


Tyr 


Gin Asn Lys 


Cys 


Ser Arg 


530 








535 




Phe 


Pro 


Cys Arg 


Gin 


Cys 


Glu Arg 


545 






550 






Phe 


Thr 


His Gly Gin 


Lys 


Asp 


Cys 








565 








Ser 


Gin 


Pro Val 


Ser 


Val 


Val 


Lys 






580 










He 


His 


His He 


Met 


Gly Lys 


Val 






595 








600 


Leu 


Val 


Asn Val 


Asp 


Leu Asp 


Asp 




610 








615 





<210> 44 
<211> 536 
<212> PRT 

<213> Artificial Sequence 



205 



Arg 


Ser 


Lys 


Thr 


Ser Ala Arg 


Tyr 






220 








Asp 


Lys 


Gly 


He 


Thr 


Ser Glu 


Lys 


235 








240 


Ser 


Tyr 
250 


He 


Ser 


Phe 


Asn Ala 
255 


Ala 


Ala 


Ala 


Leu 


Asp Asn Ala Gly 


Lys 


265 










270 




Pro Asp 


Tyr 


Leu 


Val 


Gly Gin 


Gin 










285 






Arg 


He 


Tyr 


Lys 
300 


He 


Leu Glu 


Leu 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp 


Ala 






315 








320 


Thr 


He 


Trp 


Leu 


Phe 


Gly Pro 


Ala 




330 






335 




Glu 


Ala 


He 


Ala 


His 


Thr Val 


Pro 


345 










350 




Asn 


Glu 


Asn 


Phe 


Pro 
365 


Phe Asn 


Asp 


Trp 


Glu 


Glu 


Gly Lys 


Met Thr 


Ala 






380 








He 


Leu Gly 


Gly 


Ser 


Lys Val 


Arg 






395 








400 


Ala 


Gin 


He 


Asp 


Pro 


Thr Pro 


Val 




410 






415 




Cys 


Ala 


Val 


He 


Asp 


Gly Asn 


Ser 


425 










430 




Leu Gin Asp 


Arg 


Met 


Phe Lys 


Phe 










445 






Asp 


Phe 


Gly 


Lys 


Val 


Thr Lys 


Gin 






460 








Ala 


Lys 


Asp 


His Val 


Val Glu 


Val 




475 








480 


Gly Gly Ala 


Lys 


Lys 


Arg Pro 


Ala 




490 








495 


Val 


Pro 


Lys 


Arg 


Val 


Arg 


Glu Ser 


505 






510 




Glu 


Ala 


Ser 


He Asn Tyr Ala 


Asp 










525 






His 


Val 


Gly 


Met 


Asn 


Leu Met 


Leu 






540 








Met 


Asn 


Gin 
555 


Asn 


Ser 


Asn He 


Cys 
560 


Leu 


Glu 


Cys 


Phe 


Pro 


Val Ser 


Glu 




570 






575 




Lys 


Ala 


Tyr 


Gin 


Lys 


Leu Cys 


Tyr 


585 










590 




Pro 


Asp 


Ala 


Cys 


Thr 


Ala Cys 


Asp 








605 






Cys 


He 


Phe 


Glu 


Gin 







620 



<220> 

<223> Mutant rep protein: rep68 175 GCT 
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<400> 44 , 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu. Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 HO 

Arc? Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

* 115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Ala Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 " 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 - 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 * 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 * 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 
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Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 45 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 193- GCG 

<400> 45 _ 

Slv Phe Tvr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

10 15 
Asp Ser Phe Val Asn Trp Val Ala Glu 

25 30 
Asp Ser Asp Met Asp Leu Asn Leu He 
40 45 
Ala Glu Lys Leu Gin Arg Asp Phe Leu 

60 

Lys Ala Pro Glu Ala Leu Phe Phe Val 

75 80 
Tyr Phe His Met His Val Leu Val Glu 

90 95 
Val Leu Gly Arg Phe Leu Ser Gin He 

105 HO 
He Tyr Arg Gly He Glu Pro Thr Leu 
120 " ~ ' 125 
Lys Thr Arg Asn Gly Ala Gly Gly Gly 

140 

Tyr He Pro Asn Tyr Leu Leu Pro Lys 

155 160 
Ala Trp Thr Asn Met Glu Gin Tyr Leu 

170 175 
Glu Arg Lys Arg Leu Val Ala Gin His 

185 " ~ 190 

Gin Glu Gin Asn Lys Glu Asn Gin Asn 
200 205 
He Arg Ser Lys Thr Ser Ala Arg Tyr 

220 

Val Asp Lys Gly He Thr Ser Glu Lys 

235 240 
Ala Ser Tyr He Ser Phe Asn Ala Ala 

250 255 
Lys Ala Ala Leu Asp Asn Ala Gly Lys 

265 270 
Ala Pro Asp Tyr Leu Val Gly Gin Gin 
280 285 
Asn Arg He Tyr Lys He Leu Glu Leu 

300 

Ala Ala Ser Val Phe Leu Gly Trp Ala 

315 320 
Asn Thr He Trp Leu Phe Gly Pro Ala 
330 \ 335 



Thr 


Ala 


Gly Phe 


Tyr 


/""♦I mm 


J. JLe 


1 
Glu 


His 


Leu Pro 


D 

Gly 


lie 


***m *1 t~~ 

ser 






20 








Lys 


Glu 


Trp Glu 


Leu 


Pro 


Pro 




35 






Val 


Glu 


Gin 


Ala Pro 


Leu 


Thr 




50 








55 


Thr 


Glu 


Trp Arg 


Arg 


Val 


Ser 


65 








70 




Gin 


Phe 


Glu Lys 


Gly 


Glu 


Ser 








85 






Thr Thr Gly Val 


Lys 


Ser 


Met 






100 








Arg 


Glu 


Lys Leu 


He 


Gin 


Arg 




115 








Pro 


Asn 


Trp Phe 


Ala 


Val 


Thr 




130 






135 


Asn 


Lys 


Val Val 


Asp 


Glu 


Cys 


145 






150 




Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp 








165 






Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr 






180 








Ala 


Thr 


His Val 


Ser 


Gin 


Thr 






195 








Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 




210 








215 


Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


225 








230 




Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 






245 






Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 






260 








He 


Met 


Ser Leu 


Thr 


Lys 


Thr 






275 








Pro 


Val 


Glu Asp 


He 


Ser 


Ser 




290 






295 


Asn Gly Tyr Asp 


Pro 


Gin 


Tyr 


305 








310 




Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 






325 






Thr Thr Gly Lys 


Thr 


Asn 


He 
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340 

J ** w 


Phe 


Tyr 


yjJLy 








355 

—J —J s 




Cys 


Val 




jj y s 




370 






Lys 


Val 


vax 




385 








Val Asp 


Gin 


Lys 


lie 


Val 


Thr 


ber 








/tort 


Thr 


Thr 


Phe 


Glu 






A "5 C 

435 




Glu 


Leu 


Thr 


Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


Gin 


Asn 




530 






Phe 


Pro 


Cys 


Arg 


545 








Phe 


Thr 


His 


Gly 


Ser 


Gin 


Pro 


Val 








580 


lie 


His 


His 


lie 






595 




Leu 


Val 


Asn 


Val 




610 







Val Asn Trp Thr 

360 

Met Val lie Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp lie Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 

380 

He Leu Gly Gly 
395 

Ala Gin He Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro Lys Arg Val 
505 

Glu Ala Ser He 

His Val Gly Met 

540 

Met Asn Gin Asn 
555 

Leu Glu Cys Phe 
570 

Lys Ala . Tyr Gin 
585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 

Asn Leu Met Leu 

Ser Asn He Cys 

560 

Pro Val Ser Glu 
575 

Lys Leu Cys Tyr 
590 

Thr Ala Cys Asp 

605 

Gin 



<210> 46 
<211> 536 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep68 193 GCG 



<400> 46 












Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 




115 










120 



He Lys Val Pro Ser Asp Leu Asp 

10 15 
Ser Phe Val Asn Trp Val Ala Glu 
25 30 
Ser Asp Met Asp Leu Asn Leu He 

45 

Glu Lys Leu Gin Arg Asp Phe Leu 

60 

Ala Pro Glu Ala Leu Phe Phe Val 

75 80 
Phe His Met His Val Leu Val Glu 

90 95 
Leu Gly Arg Phe Leu Ser Gin He 
105 ~ ~ HO 
Tyr Arg Gly He Glu Pro Thr Leu 

125 
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Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn Gly Ala Gly 


Gly Gly 




130 








135 








140 










Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn Tyr 


Leu 


Leu 


Pro 


Lys 


145 








150 








155 








160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn Met 


Glu 


Gin 


Tyr 


Leu 










165 








170 








175 




Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 


Lys 


Arg Leu 


Val 


Ala 


Gin 


His 






180 










185 








190 






Ala 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin Glu Gin Asn Lys 


Glu 


Asn 


Gin 


Asn 




195 










200 








205 








Pro 


Asn 


Ser 


Asp Ala 


Pro 


Val 


He 


Arg 
— * 


Ser 


Lys Thr 


Ser 


Ala 


Arg 


Tyr 




210 










215 








220 










Met 


Glu 


Leu 


Val 


Gly Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser 


Glu Lys 


225 










230 










235 






Ala 


240 


Gin 


Tro 


lie 


Gin Glu Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala 








245 










250 








255 




Ser 


Asn 


Ser 


Arg 
260 


Ser 


Gin 


He 


Lys 


Ala 
265 


Ala 


Leu Asp 


Asn 


Ala 
270 


Gly 


Lys 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 


Gly 


Gin 


Gin 






275 








280 








285 








Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




290 








295 








300 








Ala 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu Gly 


Trp 


305 




310 










315 








320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 






325 










330 








335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


lie 


Ala 


Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 






340 










345 








350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 




355 








360 








3 65 








Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly 


Lys 


Met 


Thr 


Ala 


370 








375 








380 






Val 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Lys 


Arg 


385 










390 










395 








400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 






405 










410 








415 




lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val He Asp Gly 


Asn 


Ser 








420 










425 








430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






435 










440 








445 






Gin 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu Asp 


His 


Asp 


Phe Gly Lys 


Val 


Thr 


Lys 




450 








455 








460 








Val 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp Ala Lys Asp His Val 


Val 


Glu 


465 






470 










475 








480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 










485 










490 








495 


Val 


Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu Pro Lys Arg Val Arg Glu 


Ser 






500 










505 








510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser He 


Asn 


Tyr 


Ala Asp 






515 










520 








525 








Arg 


Leu 


Ala 


Arg Gly His 


Ser 


Leu 
















530 










535 



















<210> 47 

<211> 621 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 196 GCC 
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<400> 47 „ 

Thr Ala Gly Phe Tyr Glu lie Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 HO 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 " 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Ala Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 ~ 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 " 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 " 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val -Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 ~ 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 
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Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 






515 










520 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg 


530 










535 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


545 








550 






Phe 


Thr 


His 


Gly 


Gin 


Lys 


Asp 


Cys 










565 








Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 








580 










He 


His 


His 


He 


Met 


Gly 


Lys 


Val 






595 










600 


Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 




610 










615 





<210> 48 
<211> 536 
<212> PRT 

<213> Artificial Sequence 



Gly Gly Ala Lys Lys Arg Pro Ala 

490 495 
Pro Lys Arg Val Arg Glu Ser Val 
505 510 
Glu Ala Ser He Asn Tyr Ala Asp 

525 

His Val Gly Met Asn Leu Met Leu 

540 

Met Asn Gin Asn Ser Asn He Cys 
555 560 
Leu Glu Cys Phe Pro Val Ser Glu 

570 575 
Lys Ala Tyr Gin Lys Leu Cys Tyr 
585 " 590 

Pro Asp Ala Cys Thr Ala Cys Asp 

605 

Cys He Phe Glu Gin 

620 



<220> 

<223> Mutant rep protein: rep68 196 GCC 



<400> 48 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


-145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Ala 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
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lie 


ncL 


OCX liCU 


Thr Lys 


Thr 


Ala 

JT*^L> w* 














280 


Pro 


vai 


ulU ASp 


He 


Ser 


Ser 


A eri 

/ID XX 










295 




Asn 


Caiy 


i.yr Asp 


Pro 


Gin 


Tyr 


Ala 


305 






310 






Thr 


Lys 


Lys Pne 


Gly Lys Arg 


TV e?v^ 






325 








Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 






340 










Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


inr 




355 








J U U 


Cys 


Val 


Asp Lys 


Met 


Val 


He 


i,rp 


370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 






405 








He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






420 










Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 






435 








440 


Glu 


Leu 


Thr Arg 


Arg 


Leu 


Asp 


His 




450 








455 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


465 








470 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 








485 








Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 






500 










Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 






515 








520 


Arg 


Leu 


Ala Arg 


Gly His 


Ser 


Leu 


530 








535 





<210> 49 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 



<400> 49 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 




115 










120 



-66- 



265 






270 






Pro Asp Tyr 


Leu 


Val 


Gly 
-< 


Gin 


Gin 






285 








Arg He Tyr 


Lys 


He 


Leu 


Glu 


Leu 




300 










Ala Ser Val 


Phe 


Leu 


Glv 


Trp 

XT 


Ala 


315 








320 


Thr He Trp 


Leu 


Phe 


Glv 


Pro 


Ala 


330 








335 




Glu Ala He 


Ala 


His 


Thr 

JL All 


Val 


Pro 


345 






350 

mj W 






Asn Glu Asn 


Phe 


Pro 


Phe 


Asn 


Asp 






365 








Trp Glu Glu Gly Lys 


Met 


Thr 


Ala 




380 










He Leu Gly Gly Ser 


Lys 


Val 


Arg 


395 










400 


Ala Gin He Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Cys Ala Val 


He 


Asp 


Gly 


Asn 


Ser 


425 






430 






Leu Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 






445 








Asp Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 


460 










Ala Lys Asp 


His 


Val 


Val 


Glu 


Val 


475 










480 


Gly Gly Ala Lys 


Lys 




Pro 


Ala 

f *j» *a 


490 








495 




Pro Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 






510 






Glu Ala Ser 


He 


Asn 


Tvr 


Ala 


Asp 






525 








197 GCC 












He Lys Val 


Pro 


Ser 


Asp 


Leu 


Asp 


10 








15 




Ser Phe Val 


Asn 


Trp 


Val 


Ala 


Glu 


25 






30 






Ser Asp Met 


Asp 


Leu 


Asn 


Leu 


He 




45 








Glu Lys Leu 


Gin Arg 


Asp 


Phe 


Leu 


60 










Ala Pro Glu 


Ala 


Leu 


Phe 


Phe 


Val 


75 










80 


Phe His Met 


His 


Val 


Leu 


Val 


Glu 


90 








95 




Leu Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 


105 






110 






Tyr Arg Gly 


He 


Glu 


Pro 


Thr 


Leu 



125 
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Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ala Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 " 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val -Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 ^ 375 380 

Lvs Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 " 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 " 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 ~ 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
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610 615 

<210> 50 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 



<400> 50 



Thr Ala Gly Phe 


Tyr 
5 


Glu 


lie 


val 


Glu 


His 


Leu Pro 


Gly 


He 


Ser 


Asp 






20 










Lys 


Glu 


Trp Glu 


Leu 


Pro 


Pro 


Asp 




35 








40 


Glu 


Gin 


Ala Pro 


Leu 


Thr 


Val 


A J. a. 




50 








55 




Thr Glu Trp Arg 


Arg 


Val 


Ser 


Lys 


65 








70 






Gin 


Phe 


Glu Lys 


Gly 


Glu 


Ser 


Tyr 






85 








Thr Thr Gly Val 


Lys 


Ser 


Met 


val 






100 










Arg 


Glu 


Lys Leu 


He 


G-Ln 


Arg 


lie 




115 








120 


Pro 


Asn 


Trp Phe 


Ala 


Val 


Thr 


Lys 




130 






mm a mm* 

135 




Asn 


Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


145 






150 




Ala 


Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp 








165 








Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His Val 


Ala 


Gin 


Thr 


Gin 






195 








200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




210 








215 


Val 


Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


225 








230 






Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 






245 








Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 






260 










He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 






275 








280 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 






295 




Asn Gly Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


305 








310 






Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 


Asn 




325 








Thr Thr Gly Lys 


Thr 


Asn 


He 


Ala 






340 










Phe Tyr Gly Cys 


Val 


Asn 


Trp 


Thr 






355 








360 


Cys 


Val Asp Lys 


Met 


Val 


He 


Trp 


370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 







620 



197 GCC 



He 


Lys 
10 


Val 


Pro 


ser 


ASp 


iicU 

1 c 
15 


3i en 
ASp 


Ser 


Phe 


Val 


TV «% 

Asn 


i rp 


val 




ri~\ lt 


25 










30 




He 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 








45 








Glu 


Lys 


Leu 


Gin 


Arg 


Asp 


Phe 


Leu 






60 










Ala 


Pro 


Glu 
75 


Ala 


Lieu 


"DVici 

irne 


XT lie: 


v ax 

OA 
O U 


Phe 


His 
90 


Met 


HIS 


TT 1 

vai 


i-ieu 


Val 


ril 


Leu Gly 


Arg 


Phe 


Leu 


Ser 


Gin 


He 


105 










110 






Tyr Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 










125 








Thr 


Arg 


Asn 


Gly 


Aia 






















He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 






155 








160 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 










175 


His 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


185 








190 






Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 








205 








Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 






220 










Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


235 










240 


Ser Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 




250 










255 




Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Gly 


Lys 


265 










270 






Pro Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 








285 








Arg 


He 


Tyr 


Lys 
300 


He 


Leu 


Glu 


Leu 


Ala 


Ser 


Val 
315 


Phe 


Leu 


Gly 


Trp 


Ala 
320 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 
365 


Phe 


Asn 


Asp 


Trp 


Glu 


Glu 


Gly 
380 


Lys 


Met 


Thr 


Ala 


He 


Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Arg 






395 








400 
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Val 


Asp 


Gin 


Lys Cys Lys 


Ser 








405 




He 


Val 


Thr 


Ser Asn Thr 


Asn 








420 




Thr 


Thr 


Phe 


Glu His Gin 


Gin 






435 






Glu 


Leu 


Thr 


Arg Arg Leu Asp 




450 






455 


Glu 


Val 


Lys 


Asp Phe Phe 


Arg 


465 




470 




Glu 


His 


Glu 


Phe Tyr Val 


Lys 








485 




Pro 


Ser 


Asp 


Ala Asp He 


Ser 








500 




Ala 


Gin 


Pro 


Ser Thr Ser 


Asp 






515 






Arg 


Leu 


Ala 


Arg Gly His 


Ser 


530 






535 



410 415 
Met Cys Ala Val He Asp Gly Asn Ser 

425 430 
Pro Leu Gin Asp Arg Met Phe Lys Phe 
440 445 
His Asp Phe Gly Lys Val Thr Lys Gin 

460 

Trp Ala Lys Asp His Val Val Glu Val 

475 480 
Lys Gly Gly Ala Lys Lys Arg Pro Ala 

490 495 
Glu Pro Lys Arg Val Arg Glu Ser Val 

505 510 
Ala Glu Ala Ser He Asn Tyr Ala Asp 
520 525 



<210> 51 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 221 GCA 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


1 






5 










10 


Val 




Glu 


His 


Leu 


Pro 
20 


Gly 


He 


Ser 


Asp 


Ser 
25 


Phe 


Asn 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp 


Met 


Asp 




35 










40 










Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu Gin 


50 










55 










60 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


65 








70 








His 


75 


His 


Gin 


Phe 


Glu 


I7ys 


Gly 


Glu 


Ser 


Tyr 


Phe 


Met 








85 










90 




Phe 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly Arg 






100 








105 






He 


Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly 




115 










120 










Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly 




130 








135 










140 


Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


145 








150 








Thr 


155 




Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Asn 


Met 










165 








170 






Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 


Lys Arg 


Leu 






180 










185 








Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn 


Lys 






195 










200 








Thr 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 




210 








215 










220 


Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He 


225 










230 










235 




Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 








245 










250 






Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 



15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ala Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 
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260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 * 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

J 405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 ~ 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 ~ 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 57 ^ 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 52 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 221 GCA 
<400> 52 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lvs Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
35 40 45 
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ulu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




J. J.1X 


Glu 


Trp Arg 


Arg 


Val 


Ser Lys 


DO 










70 






Cain 


Phe 


Glu Lys 


Gly Glu Ser Tyr 










85 








Tnr 


Thr Gly Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin Arg 


He 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr Lys 




13 0 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


14b 








150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


■TV t _ 

Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


lie 




210 








215 




Met 


Glu 


Leu 


Val 


Gly Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 








260 










lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 










295 




Asa 


Gly Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


*3 n c 










310 






Thr 


Lys 


Lys 


Phe 


Gly Lys Arg 


Asn 








325 








inr 


Thr Gly Lys 


Thr 


Asn 


He 


Ala 








340 










pne 


Tyr Gly Cys 


Val 


Asn 


Trp 


Thr 






355 










360 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


370 










375 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 










Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 






435 










440 


Glu 


Leu Thr Arg 


Arg 


Leu 


Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 








470 






Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 








Pro 


Ser Asp 


Ala 


Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 






515 










520 


Arg 


Leu 


Ala 


Arg 


Gly His 


Ser 


Leu 
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Glu 


Lys 


Leu Gin Arg Asp 


Phe 


Leu 




60 










Ala 


Pro 


Glu Ala 
75 


Leu 


Phe 


Phe 


Val 
80 


Phe 


His 
90 


Met His 


Val 


Leu 


Val 
95 


Glu 


Leu Gly 


Arg Phe 


Leu 


Ser 


Gin 


He 


105 








110 






Tyr Arg 


Gly He 


Glu 


Pro 


Thr 


Leu 








125 








Thr Arg 


Asn Gly Ala Gly 


Gly Gly 






140 










He 


Pro 


Asn Tyr 
155 


Leu 


Leu 


Pro 


Lys 
160 


Trp 


Thr 


Asn Met 


Glu 


Gin 


Tyr 


Leu 


170 








175 




Arg 


Lys 


Arg Leu 


Val 


Ala 


Gin 


His 


185 






190 






Glu 


Gin 


Asn Lys 


Glu 
205 


Asn 


Gin 


Asn 


Arg 


Ser 


Lys Thr 


Ala 


Ala 


Arg Tyr 




220 










Asp 


Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


235 








240 


Ser 


Tyr 
250 


He Ser 


Phe 


Asn 


Ala 
255 


Ala 


Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly Lys 


265 








270 




Gin 


Pro 


Asp 


Tyr Leu 


Val 
285 


Gly 


Gin 


Arg 


He 


Tyr Lys 
300 


He 


Leu 


Glu 


Leu 


Ala 


Ser 


Val Phe 


Leu Gly 


Trp 


Ala 






315 








320 


Thr 


He 


Trp Leu Phe Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 


345 








350 






Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 








365 








Trp 


Glu 


Glu Gly Lys 


Met 


Thr 


Ala 




380 










He 


Leu 


Gly Gly Ser Lys 


Val 


Arg 






395 








400 


Ala 


Gin 
410 


He Asp 


Pro 


Thr 


Pro 
415 


Val 


Cys 


Ala 


Val He 


Asp 


Gly 


Asn 


Ser 


425 








430 




Phe 


Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys 






445 








Asp 


Phe 


Gly Lys 
460 


Val 


Thr 


Lys 


Gin 


Ala 


Lys 


Asp His 


Val 


Val 


Glu 


Val 




475 








480 


Gly Gly 


Ala Lys 


Lys Arg 


Pro 


Ala 




490 








495 




Pro 


Lys 


Arg Val 


Arg 


Glu 


Ser 


Val 


505 








510 






Glu 


Ala 


Ser He Asn Tyr 


Ala 


Asp 



525 
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530 535 

<210> 53 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 228 GCG 



^hr°Ala 3 Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 3 

Lys Glu Trp Glu Leu Pro Pro Asp. Ser Asp. Met Asp Leu Asn Leu He 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 

" n 75 



5 10 
;ly Il« 

,eu Pr< 

,eu Th: 

Urg Va: 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 9° 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr . Lys Thr Arg Asn Gly Ala Gly Gly Gly 

135 140 

Asp Glu 
150 
Leu Gin 
165 

Asn Leu 
180 **■*"' 
Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 " 215 220 

Met Glu Leu Ala Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 2.3S 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

~ - — 250 "3 



Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

- tin 175 



150 155 



170 175 



Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

jL 8 -5 



245 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

<~ a occ 270 



He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 2 80 2 85 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 ~ " 310 315 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 ^ 360 365 

Cvs Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lvs Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 



265 270 
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Val Asp Gin Lys Cys Lys Ser Ser 

405 

lie Val Thr Ser Asn Thr Asn Met 

420 

Thr Thr Phe Glu His Gin Gin Pro 
435 440 
Glu Leu Thr Arg Arg Leu Asp His 

450 455 
Glu Val Lys Asp Phe Phe Arg Trp 
465 470 
Glu His Glu Phe Tyr Val Lys Lys 

485 

Pro Ser Asp Ala Asp lie Ser Glu 

500 

Ala Gin Pro Ser Thr Ser Asp Ala 
515 520 
Arg Tyr Gin Asn Lys Cys Ser Arg 

530 535 
Phe Pro Cys Arg Gin Cys Glu Arg 
545 550 
Phe Thr His Gly Gin Lys Asp Cys 

565 

Ser Gin Pro Val Ser Val Val Lys 

580 

He His His He Met Gly Lys Val 
595 600 
Leu Val Asn Val Asp Leu Asp Asp 
610 615 

<210> 54 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 



<400> 54 



Met 


Glu 


Leu 


Ala 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 










40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 



Ala 


Gin He 


Asp 


Pro 


Thr 


Pro 


Val 




410 






415 




Cys 


Ala Val 


He 


Asp 


Gly Asn Ser 


425 








430 






Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 








445 








Asp 


Phe Gly Lys Val 


Thr 


Lys 


Gin 






460 










Ala 


Lys Asp 


His 


Val 


Val 


Glu 


Val 




47S 










480 


Gly 


Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 




490 








495 




Pro 


Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 








510 






Glu 


Ala Ser 


He 


Asn 


Tyr Ala Asp 








525 








His 


Val Gly 


Met 


Asn 


Leu 


Met 


Leu 




540 










Met 


Asn Gin 


Asn 


Ser 


Asn 


He 


Cys 




555 










560 


Leu 


Glu Cys 


Phe 


Pro 


Val 


Ser 


Glu 




570 








575 




Lys 


Ala Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 


585 








590 






Pro 


Asp Ala 


Cys 


Thr 


Ala 


Cys 


Asp 








605 








Cys 


He Phe 


Glu 


Gin 









620 



228 


GCG 










Asp 


Lys Gly 


He 


Thr 


Ser 


Glu Lys 




10 








15 


Ser 


Tyr He 


Ser 


Phe 


Asn 


Ala Ala 


25 






30 




Ala 


Ala Leu Asp Asn 


Ala Gly Lys 








45 






Pro 


Asp Tyr 


Leu 


Val 


Gly Gin Gin 






60 








Arg 


He Tyr 


Lys 


He 


Leu 


Glu Leu 




75 








80 


Ala 


Ser Val 


Phe 


Leu 


Gly Trp Ala 




90 








95 


Thr 


He Trp 


Leu 


Phe 


Gly 


Pro Ala 


105 








110 




Glu 


Ala He 


Ala 


His 


Thr 


Val Pro 








125 






Asn 


Glu Asn 


Phe 


Pro 


Phe 


Asn Asp 






140 








Trp 


Glu Glu Gly Lys 


Met 


Thr Ala 


155 








160 


He 


Leu Gly Gly 


Ser 


Lys 


Val Arg 




170 








175 


Ala 


Gin He 


Asp 


Pro 


Thr 


Pro Val 
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180 185 190 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 365 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 3.80 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
385 390 395 

<210> 55 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 228 GCG 

<400> 55 , 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

! - 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 HO 

Atq Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 I 75 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 
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Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn 


Lys Glu 


Asn Gin Asn 




195 










200 








205 




Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 


Thr Ser Ala Arg Tyr 




210 










215 










220 




Met 


Glu 


Leu 


Ala 


Gly Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He Thr 


Ser Glu Lys 


225 










230 










235 




240 


Gin 


Trp 


lie 


Gin 


Glu Asp Gin Ala Ser Tyr 


He 


Ser Phe 


Asn Ala Ala 








245 










250 






255 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


lie 


Lys 


Ala 


Ala 


Leu 


Asp Asn Ala Gly Lys 








260 










265 








270 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu Val Gly Gin Gin 




275 








280 








285 


Leu Glu Leu 


Pro 


Val 


Glu 


Asp 


lie 


Ser 


Ser Asn Arg 


He 


Tyr 


Lys He 




290 








295 










300 




Asn 


Glv 


Tyr Asp 


Pro Gin Tyr Ala Ala 


Ser 


Val 


Phe Leu Gly Trp Ala 


305 








310 










315 






THt* 

.i ill. 


IiVS 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu Phe 


Gly Pro Ala 






325 










330 






335 






Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala His 


Thr Val Pro 






340 










345 








350 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe Pro 


Phe Asn Asp 




355 








360 








365 


Met Thr Ala 


Cvs 

J: 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu 


Gly Lys 


370 






375 










380 


Lys Val Arg 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly 


Gly Ser 


385 










390 










395 




400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He 


Asp Pro 


Thr Pro Val 






405 










410 






415 


lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val 


He Asp 


Gly Asn Ser 






420 










425 








430 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp 


Arg Met 


Phe Lys Phe 




435 










440 








445 


Thr Lys Gin 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly 


Lys Val 




450 








455 










460 


Val Glu Val 


Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala 


Lys 


Asp 


His Val 


465 






470 










475 




480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala 


Lys Lys 


Arg Pro Ala 










485 










490 






495 


Pro 


Ser 


Asp 


A-la 


Asp 


lie 


Ser 


Glu 


Pro 


Lys 


Arg 


Val Arg 


Glu Ser Val 






500 








505 








510 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 


Ala 


Ser 


He Asn Tyr Ala Asp 






515 










520 








525 




Arg 


Leu 


Ala Arg 


Gly His 


Ser 


Leu 












530 










535 















<210> 56 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 228 GCG 
<400> 56 

Met Glu Leu Ala Gly Trp Leu Val Asp Lys Gly 

15 10 
Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He 

20 25 
Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu 

35 40 
He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr 



He Thr Ser Glu Lys 

15 

Ser Phe Asn Ala Ala 
30 

Asp Asn Ala Gly Lys 
45 

Leu Val Gly Gin Gin 
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50 

Pro Val Glu Asp 
65 

Asn Gly Tyr Asp 

Thr Lys Lys Phe 

100 

Thr Thr Gly Lys 
115 

Phe Tyr Gly Cys 
130 

Cys Val Asp Iiys 
145 

Lys Val Val Glu 

Val Asp Gin Lys 

180 

lie Val Thr Ser 
195 

Thr Thr Phe Glu 
210 

Glu Leu Thr Arg 
225 

Glu Val Lys Asp 

Glu His Glu Phe 

260 

Pro Ser Asp Ala 
275 

Ala Gin Pro Ser 
290 

Arg Leu Ala Arg 
305 



55 

He Ser Ser Asn 
70 

Pro Gin Tyr Ala 
85 

Gly Lys Arg Asn 

Thr Asn He Ala 

120 

Val Asn Trp Thr 
135 

Met Val He Trp 
150 

Ser Ala Lys Ala 
165 

Cys Lys Ser Ser 

Asn Thr Asn Met 

200 

His Gin Gin Pro 
215 

Arg Leu Asp His 
230 

Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp He Ser Glu 

280 

Thr Ser Asp Ala 
295 

Gly His Ser Leu 
310 



60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 



He Leu Glu Leu 

80 

Leu Gly Trp Ala 
95 

Phe Gly Pro Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 



<210> 57 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep7 8 231 GCC 



<400> 57 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 







Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 



135 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 
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Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 " 215 220 

Met Glu Leu Val Gly Trp Ala Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 ~ 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 * 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 ~ 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 " 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 ~ 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Ara Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 " 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 



WO 03/018820 



PCT/IB02/04087 



-78- 



<210> 58 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: 52 231 GCC 
<400> 58 

Met Glu Leu Val Gly Trp Ala Val Asp Lys Gly He Thr Ser Glu Lys 

1 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lvs Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 " 105 HO 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala lie Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cvs Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 " 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 " 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Ara Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 365 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
385 390 395 



<210> 59 
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<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 231 GCC 
<400> 59 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


Ser Asp 


Leu Asp 


1 






5 










10 










15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn 


Trp 


Val 


Ala 


Glu 






20 








25 










30 




He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 




35 










40 










45 








Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin 


Arg 


Asp 


Phe 


Leu 


50 










55 










60 








Val 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


65 






70 










75 








Val 


80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 


His 


Val 


Leu 


Glu 








85 










90 










95 


He 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly 


Arg 


Phe 


Leu 


Ser 


Gin 






100 








105 










110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 


Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 




115 










120 










125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg 


Asn Gly 


Ala Gly Gly Gly 




130 








i o e 

135 










140 










Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


145 






150 










155 




Glu 


Gin 




160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 


Met 


Tyr 


Leu 










165 








J. / U 










175 


His 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 






180 










185 










190 


Gin 




Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Asn 




195 










200 










205 


Ala 




Tyr 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 


Thr 


Ser 


Arg 




210 








215 










220 






Glu 


Lys 


Met 


Glu 


Leu 


Val 


Gly 


Trp 


Ala 


Val 


Asp 


Lys 


Gly 


He 


Thr 


Ser 


225 








230 










235 










240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 








245 








250 










255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu 


Asp 


Asn Ala Gly Lys 








260 










265 










270 






He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr 


Leu 


Val Gly Gin Gin 






275 








280 










285 




Glu 


Leu 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn Arg 


He 


Tyr 


Lys 


He 


Leu 




290 










295 










300 








Ala 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp 


305 




310 










315 










320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe 


Gly Pro 


Ala 






325 










330 














Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 








340 










345 










350 






Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe Asn Asp 




355 










360 










365 




Thr 


Ala 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu Gly 


Lys 


Met 


370 






375 










380 










Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly 


Ser Lys Val Arg 


385 










390 








395 






Thr 




400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He 


Asp 


Pro 


Pro 


Val 






405 










410 










415 


Ser 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val 


He 


Asp 


Gly Asn 
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420 






425 


Thr 


Thr 


Phe 


Glu His 


Gin Gin 


Pro 


Leu 






435 






440 




Glu 


Leu 


Thr 


Arg Arg 


Leu Asp 


His 


Asp 




450 




455 




Ala 


Glu 


Val 


Lys 


Asp Phe 


Phe Arg 


Trp 


465 






470 






Glu 


His 


Glu 


Phe Tyr 


Val Lys 


Lys Gly 








485 








Pro 


Ser 


Asp 


Ala Asp 


He Ser 


Glu 


Pro 






500 






505 


Ala 


Gin 


Pro 


Ser Thr 


Ser Asp 


Ala 


Glu 






515 






520 




Arg 


Leu 


Ala 


Arg Gly His Ser 


Leu 




530 






535 







<210> 60 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 231 



<400> 60 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Ala 


Val 


Asp 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 










25 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


lie 


Lys 


Ala 






35 








40 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 










55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


65 








70 








Asn 


Gly 


Tyr 


Asp 


Pro 
85 


Gin 


Tyr 


Ala 


Ala 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 




1-00 










105 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






115 








120 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 




130 










135 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


145 










150 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 








165 










Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 






180 










185 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 






195 










200 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 




210 










215 






Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 


225 










230 








Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp Ala 










245 










Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 








260 








265 


Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 






275 










280 
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430 






Gin Asp Arg 


Met 
445 


Phe 


Lys 


Phe 


Phe Gly Lys 


Val 


Thr 


Lys 


Gin 


460 










Lys Asp His 


Val 


Val 


Glu 


Val 


475 








480 


Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 


490 






495 




Lys Arg Val 


Arg 


Glu 
510 


Ser 


Val 


Ala Ser He 


Asn 
525 


Tyr 


Ala 


Asp 



GCC 



Lys Gly He 


Thr 


Ser 


Glu 


Lys 


10 






15 




Tyr He Ser 


Phe 


Asn 


Ala 


Ala 




30 






Ala Leu Asp 


Asn 


Ala 


Gly 


Lys 




45 








Asp Tyr Leu 


Val 


Gly 


Gin 


Gin 


60 










He Tyr Lys 


He 


Leu 


Glu 


Leu 


75 








80 


Ser Val Phe 


Leu Gly 


Trp 


Ala 


90 






95 




He Trp Leu 


Phe 


Gly 


Pro 


Ala 




110 






Ala He Ala 


His 


Thr 


Val 


Pro 




125 








Glu Asn Phe 


Pro 


Phe 


Asn 


Asp 


140 










Glu Glu Gly 


Lys 


Met 


Thr 


Ala 


155 






160 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


170 






175 


Val 


Gin He Asp 


Pro 


Thr 


Pro 




190 






Ala Val He 


Asp 


Gly 


Asn 


Ser 




205 








Gin Asp Arg 


Met 


Phe 


Lys 


Phe 


220 










Phe Gly Lys 


Val 


Thr 


Lys 


Gin 


235 








240 


Lys Asp His 


Val 


Val 


Glu 


Val 


250 






255 




Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 






270 






Lys Arg Val 


Arg 


Glu 


Ser 


Val 




285 
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Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 61 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 234 GCG 

Thr°Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 ~ 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
6$ 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu. 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 HO 

Atq Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 ^ 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Ala Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 - 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
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370 








. 375 






-X— 1 jr O 


Val 


Val 


Glu 


Ser 


Ala Lys 


Ala 


lie 


385 

J O mj 










390 






V ci-L 


TV c*T"\ 


Gin Lys 




Lvs Ser 


Ser 


Ala 








405 








11c 


Vdl 


Thr 


Ser 




Thr Asn 


Met 


Cys 








420 








425 


lnr 


inr 


Phe 


Glu 


His 


Gin Gin 


Pro 


Leu 






435 








440 




(jr±U 


Lieu 


Thr Arg 


TV -f~/-r 


Tif»11 Ae*T"» 

JJCU 


His 


Asp 




450 








455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg 


Trp Ala 


465 








470 






Glu 


His 


Glu 


Phe 


Tyr 


Val Lys 


Lys 


Gly 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


lie Ser 


Glu 


Pro 






500 








505 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 






515 








520 




Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser 


Arg 


His 


530 








535 






Phe 


Pro 


Cys 


Arg 


Gin 


Cys Glu 


Arg Met 


545 










550 






Phe 


Thr 


His 


Gly 


Gin 


Lys Asp 


Cys 


Leu 










565 








Ser 


Gin 


Pro 


Val 


Ser 


Val Val 


Lys 


Lys 








580 








585 


lie 


His 


His 


lie 


Met 


Gly Lys 


Val 


Pro 






595 








600 




Leu 


Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 


Cys 




610 








615 







<210> 62 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant ~rep protein: rep52 234 



<400> 62 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


Asp 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 










25 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 






35 










40 




lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 










55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


65 










70 








Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 










85 










Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 






100 










105 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






115 








120 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 




13 0 










135 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 



145 150 
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380 



Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 




395 








400 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Ala 


Val He 


Asp 


Gly 


Asn 


Ser 








430 






Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 






445 








Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




460 










Lys 


Asp His 


Val 


Val 


Glu 


Val 


475 








480 


Gly 


Ala Lys 


Lys 


Arg 


Pro 


Ala 


490 








495 




Lys 


Arg Val 


Arg 


Glu 


Ser 


Val 






510 






Ala 


Ser He 


Asn 


Tyr 


Ala 


Asp 






525 








Val 


Gly Met 


Asn 


Leu 


Met 


Leu 




540 










Asn 


Gin Asn 


Ser 


Asn 


He 


Cys 




555 








560 


Glu 


Cys Phe 


Pro 


Val 


Ser 


Glu 


570 






575 




Ala 


Tyr Gin 


Lys 


Leu 


Cys 


Tyr 






590 






Asp 


Ala Cys 


Thr 


Ala 


Cys 


Asp 






605 








He 


Phe Glu 


Gin 










620 










GCG 












Ala 


Gly He 


Thr 


Ser 


Glu 


Lys 


10 






15 




Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 






30 






Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 






45 








Asp 


Tyr Leu 


Val 


Gly 


Gin 


Gin 




60 










He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




75 








80 


Ser 


Val Phe 


Leu 


Gly 


Trp 


Ala 


90 








95 




He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 






110 






Ala 


He Ala 


His 


Thr 


Val 


Pro 






125 








Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 




140 










Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 




155 








160 
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Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 






180 










lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 


His 


Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 








245 








Glu 


His 


Glu 


Phe 


Tyr Val Lys Lys 








260 










Pro 


Ser 


Asp 


Ala 


Asp 


lie Ser Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 




290 










295 




Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


305 








310 






Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 








325 








Phe 


Thr 


His 


Gly 


Gin 


Lys 


Asp 


Cys 








340 










Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 






355 










360 


lie 


His 


His 


lie 


Met 


Gly Lys 


Val 




370 










375 




Leu 


Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 


385 










390 






<210> 63 












<211> 536 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> Mutant rep protein: rep68 


<400> 63 










Val 


Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 






100 










Arg 


Glu 


Lys 


Leu 


lie 


Gin 


Arg 


He 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 








150 




Ala 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 



170 175 
Ala Gin He Asp Pro Thr Pro Val 
185 190 
Cys Ala Val He Asp Gly Asn Ser 

205 

Leu Gin Asp Arg Met Phe Lys Phe 

220 

Asp Phe Gly Lys Val Thr Lys Gin 
235 240 
Ala Lys Asp His Val Val Glu Val 

250 255 
Gly Gly Ala Lys Lys Arg Pro Ala 
265 270 
Pro Lys Arg Val Arg Glu Ser Val 

285 

Glu Ala Ser lie Asn Tyr Ala Asp 

300 

His Val Gly Met Asn Leu Met Leu 
315 320 
Met Asn Gin Asn Ser Asn He Cys 

330 335 
Leu Glu Cys Phe Pro Val Ser Glu 
345 * 350 

Lys Ala Tyr Gin Lys Leu Cys Tyr 

365 

Pro Asp Ala Cys Thr Ala Cys Asp 

380 

Cys He Phe Glu Gin 
395 



10 15 
Ser Phe Val Asn Trp Val Ala Glu 
25 30 
Ser Asp Met Asp Leu Asn Leu He 

45 

Glu Lys Leu Gin Arg Asp Phe Leu 

60 

Ala Pro Glu Ala Leu Phe Phe Val 

75 80 
Phe His Met His Val Leu Val Glu 

90 95 
Leu Gly Arg Phe Leu Ser Gin He 
105 ' HO 
Tyr Arg Gly He Glu Pro Thr Leu 

125 

Thr Arg Asn Gly Ala Gly Gly Gly 

140 

He Pro Asn Tyr Leu Leu Pro Lys 
155 1^0 
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165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Ala Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Glv Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lvs Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 " 375 380 

Lvs Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 ~ 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 - 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 " 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 64 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 234 GCG 

<400> 64 _ 

Met Glu Leu Val Gly Trp Leu Val Asp Ala Gly He Thr Ser Glu Lys 

! 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 
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uCI 




OCX y 


Ser 

w ^ 


Gin 


He Lys Ala Ala Leu Asp Asn Ala 


Gly Lys 












40 


45 






Gin 


Tl <=>• 
lie 


l v lc I— 


ocx lieu 


TYiTT 

X li I Li 




Thr Ala Pro Asp Tyr 


Leu Val 


Gly 


Gin 




en 
-j \j 






55 


60 




Glu 




FL. O 


vdJ. 


Glu Asp 


He 


*J w X 


Ser Asn Arg He Tyr 


Lys He 


Leu 


Leu 


DO 






70 


75 








80 


Asn 


Giy 


Tyr Asp 


Pro 


m n 

uXil 


Tyr Ala Ala Ser Val 


Phe Leu Gly 


Trp 


Ala 




85 




90 






95 


Ala 


Thr 


Lys 


Lys Phe 


Gly 


iiys 


Arg Asn Thr He Trp 


Leu Phe Gly 


Pro 




100 




105 




110 






Thr 


Thr 


Gly Lys 


Thr 


TV 

ash 


He Ala Glu Ala He 


Ala His 


Thr 


Val 


Pro 






115 






120 


125 










Tyr 


Gly Cys 


Val 




Trp Thr Asn Glu Asn 


Phe Pro 


Phe 


Asn Asp 




130 






135 


140 






Ala 


Cys 


Val 


Asp Lys 


Met 


Val 


He Trp Trp Glu Glu 


Gly Lys 


Met 


Thr 


145 






150 


155 






Val 


160 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys Ala He Leu Gly Gly Ser Lys 


Arg 






165 




170 






175 


Val 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser Ser Ala Gin He 


Asp Pro 


Thr 


Pro 




180 




185 




190 






He 


Val 


Thr Ser 


Asn 


Thr 


Asn Met Cys Ala Val 


He Asp 


Gly 


Asn 


Ser 






195 






200 


205 






Phe 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin Pro Leu Gin Asp Arg Met 


Phe 


Lys 




210 








215 


220 






Gin 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp His Asp Phe Gly Lys Val 


Thr 


Lys 


225 








230 


235 






Glu 


240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg Trp Ala Lys Asp 


His Val 


Val 


Val 






245 




250 






255 


Ala 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys Lys Gly Gly Ala 


Lys Lys 


Arg 


Pro 






260 




265 




270 




Val 


Pro 


Ser 


Asp Ala 


Asp 


He 


Ser Glu Pro Lys Arg 


Val Arg 


Glu 


Ser 






275 




280 


285 








Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp Ala Glu Ala Ser 


He Asn Tyr 


Ala 


Asp 




290 








295 


300 








Arg 


Leu 


Ala Arg Gly His 


Ser Leu 











305 310 



<210> 65 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 237 GCC 



<400> 65 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val He Lys Val Pro Ser Asp 


Leu 


Asp 


1 






5 






10 


15 


Glu 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp Ser Phe Val Asn Trp Val 


Ala 








20 






25 30 




He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp Ser Asp Met Asp Leu Asn 


Leu 




35 










40 45 






Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala Glu Lys Leu Gin Arg Asp 


Phe 


Leu 




50 










55 


60 




Val 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys Ala Pro Glu Ala Leu Phe 


Phe 


65 






70 




75 




80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr Phe His Met His Val Leu 


Val 


Glu 








85 






90 


95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val Leu Gly Arg Phe Leu Ser 


Gin 


He 






100 






105 110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly He Glu Pro 


Thr 


Leu 
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115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Ala Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 ~ 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 ~ 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 " 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arq Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 . 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 
545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 " 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 
595 600 605 
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Leu Val Asn Val Asp Leu Asp Asp 
610 615 

<210> 66 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 



<400> 66 



ft M _ A_ 

Met 


Glu 


Leu 


val 


r*l i r 

Vaiy 


Trp 


T .01 t 


Val 
val 


l 








c 








Gin 


Trp 


lie 


Gin 


blU 


ASp 


VjlJl 


ax a 






20 










Ser 


Asn 


ser 


Arg 


O a 




JL 1C2 


T .\ra 














40 


lie 


Met 


£>er 


Leu 


inr 


jjys 


J. 111 


t\Jl d 




bU 










—j 




Pro 


vaJ. 


G1U 




Tl A 

lie 


O A 


oci 


Ren 
noil 


bo 










7fl 
/ u 






Asn 


Gly 


Tyr 


Asp 


fro 


bin 


ryr 


Ala 










Q C 
OS 








Thr 


Lys 


Lys 






i»ys 




A O Tl 

ash 








i no 










Txir 


rpV-> 

inr 


CaXy 


iiys 


inr 




11c 


2X1 a 
Alex 




i 


J L 3 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 








260 










Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 




290 










295 




Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


305 










310 






Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 










325 








Phe 


Thr 


His 


Gly 


Gin 


Lys 


Asp 


Cys 








340 










Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 






355 










360 


He 


His 


His 


He 


Met 


Gly 


Lys 


Val 




370 










375 




Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 



Cys He Phe Glu Gin 

620 



237 


GCC 










Asp 


Lys 


Gly He Ala 


Ser 


Glu 






10 






15 




Ser Tyr 


He Ser Phe 


Asn 


Ala 


Ala 


25 






30 






Ala 


Ala 


Leu Asp Asn Ala Gly 


TiVS 






45 








Pro Asp 


Tyr Leu Val 


Gly Gin 


f3l n 

V3111 






60 








Arg 


He 


Tyr Lys He 


Leu 


Glu 


UC H 




75 






o u 


Ala 


Ser 


Val Phe Leu Gly 


Trp 






90 






95 




Thr 


He 


Trp Leu Phe 


Gly 


Pro 


Ala 


105 






110 






Glu 


Ala 


He Ala His 


Thr 


Val 


Pro 






125 








Asn 


Glu 


Asn Phe Pro 


Phe 


Asn 


Asn 






140 








Trp 


Glu 


Glu Gly Lys 


Met 


Thr 


Ala 




155 






160 


He 


Leu 


Gly Gly Ser 


Lys 


Val 


Arg 




170 






175 




Ala 


Gin 


He Asp Pro 


Thr 


Pro 


Val 


185 






190 






Cys 


Ala 


Val He Asp Gly 


Asn 


Ser 




205 








Leu 


Glri 


Asp Arg Met 


Phe 


Lys 


Phe 






220 








Asp 


Phe 


Gly Lys Val 


Thr 


Lys 


Gin 




235 






240 


Ala 


Lys 


Asp His Val 


Val 


Glu 


Val 




250 






255 




Gly Gly 


Ala Lys Lys 


Arg 


Pro 


Ala 


265 






270 






Pro 


Lys 


Arg Val Arg 


Glu 


Ser 


Val 






285 








Glu 


Ala 


Ser He Asn Tyr 


Ala 


Asp 






300 








His 


Val 


Gly Met Asn Leu 


Met 


Leu 






315 






320 


Met 


Asn 


Gin Asn Ser 


Asn 


He 


Cys 




330 






335 




Leu 


Glu 


Cys Phe Pro 


Val 


Ser 


Glu 


345 






350 






Lys 


Ala 


Tyr Gin Lys 


Leu 


Cys 


Tyr 






365 








Pro Asp 


Ala Cys Thr 


Ala 


Cys 


Asp 






380 








Cys 


He 


Phe Glu Gin 









WO 03/018820 



PCT/IB02/04087 



-88- 



385 390 

<210> 67 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 



<400> 67 



Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 






100 










Arg 


Glu 


Lys 


Leu 


He 


- , — ■ 

Gin 


Arg 


He 




115 








120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 








150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 








180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 




210 








215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


lie 


Lys 








260 










lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 




290 








295 




Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


305 










310 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








325 








Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




355 










360 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


370 










375 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 







395 



237 


GCC 












He 


Lys 


Val Pro 


Ser Asp 


Leu Asp 




10 








15 




Ser 


Phe 


Val Asn 


Trp 


Val 


Ala 


Glu 


25 








30 






Ser 


Asp 


Met Asp 


Leu 


Asn 


Leu 


He 






45 








Glu 


Lys 


Leu Gin Arg Asp 


Phe 


Leu 




60 










Ala 


Pro 


Glu Ala 


Leu 


Phe 


Phe 


Val 






75 








80 


Phe 


His 


Met His 


Val 


Leu 


Val 


Glu 




90 








95 




Leu 


Gly 


Arg Phe 


Leu 


Ser 


Gin 


He 


105 






110 






Tyr Arg 


Gly He 


Glu 


Pro 


Thr 


Leu 








125 








Thr Arg 


Asn Gly Ala Gly 


Gly 


Gly 






140 










He 


Pro 


Asn Tyr 


Leu 


Leu 


Pro 


Lys 






155 








160 


Trp 


Thr 


Asn Met 


Glu 


Gin 


Tyr 


Leu 


170 








175 




Arg 


Lys 


Arg Leu 


Val 


Ala 


Gin 


His 


185 








190 






Glu 


Gin 


Asn Lys 


Glu 


Asn 


Gin 


Asn 








205 








Arg 


Ser 


Lys Thr 


Ser 


Ala 


Arg 


Tyr 




220 










Asp 


Lys 


Gly He Ala 


Ser 


Glu 


Lys 


235 








240 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 




250 








255 




Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 


265 






270 






Pro 


Asp 


Tyr Leu Val Gly 


Gin 


Gin 






285 








Arg 


He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




300 










Ala 


Ser 


Val Phe 


Leu Gly 


Trp 


Ala 






315 








320 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 




330 






335 




Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 


345 








350 






Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 








365 








Trp 


Glu 


Glu Gly Lys 


Met 


Thr 


Ala 




380 










He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 






395 








400 
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Val 


Asp 


Gin Lys Cys 


Lys 


Ser 


Ser 


Ala 


Gin He Asp 


Pro 


Thr 


Pro 


Val 




405 










410 






415 




He 


Val 


Thr Ser Asn 


Thr 


Asn 


Met 


Cys 


Ala Val He 


Asp 


Gly Asn Ser 






420 








425 






430 






Thr 


Thr 


Phe Glu His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






435 






440 






445 








Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe Gly Lys Val 


Thr 


Lys 


Gin 




450 




455 






460 










Glu 


Val 


Lys Asp Phe 


Phe Arg Trp Ala 


Lys Asp His 


Val 


Val 


Glu 


Val 


465 




470 








475 








480 


Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys Gly 


Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 






485 










490 






495 


Val 


Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys Arg Val 


Arg 


Glu 


Ser 






500 








505 






510 






Ala 


Gin 


Pro Ser Thr 


Ser Asp 


Ala 


Glu 


Ala Ser He 


Asn 


Tyr Ala Asp 






515 






520 






525 








Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 














530 






535 

















<210> 68 
<211> 312 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep40 237 GCC 



<400> 68 




















Ser Glu Lys 


Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val Asp 


Lys 


Gly He 


Ala 


1 








5 








10 






15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn Ala Ala 






20 










25 








30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 






35 








40 








45 


Gly Gin Gin 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 




50 








55 








60 




Leu Glu Leu 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


He 


Tyr Lys 


He 


65 








70 










75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 




85 










90 






95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly Pro Ala 




100 










105 








110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr Val Pro 






115 








120 








125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Asn Asp 




130 






135 








140 




Met Thr Ala 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu Gly Lys 


145 






150 










155 




160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly Ser 


Lys Val Arg 








165 










170 






175 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr Pro Val 






180 










185 








190 


He 


Val 


Thr 
195 


Ser 


Asn 


Thr 


Asn 


Met 
200 


Cys 


Ala 


Val He 

• 


Asp 
205 


Gly Asn Ser 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg Met 


Phe Lys Phe 




210 










215 








220 




Thr Lys Gin 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly Lys 


Val 


225 








230 










235 




240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys 


Asp His 


Val 


Val Glu Val 






245 










250 






255 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 


Gly Ala Lys Lys 


Arg Pro Ala 
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260 

Pro Ser Asp Ala Asp lie Ser Glu 
275 280 
Ala Gin Pro Ser Thr Ser Asp Ala 

290 295 
Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 69 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 



<400> 69 



XIl-L. 


Ala Gly 


rue 


TVTT 
c 


Glu 

^■.j 


He 


Val 


J. 

bXU 


His 


Leu 


C X. \J 


Glv 


lie 


Ser 

W A* 


Asn 


















JjyS 


Glu 


Trp 




XJC u 


Pro 

f Jm W 


Pro 


Aso 




35 












Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 








85 








Thr 


Thr Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 








150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 








180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 




210 








215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 








260 










He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 




290 








295 




Asn 


Gly Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


305 










310 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








325 








Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 











-90- 



265 










270 






Pro 


Lys Arg Val Arg 


Glu 


Ser 


Val 










285 








Glu 


Ala 


Ser 


He 


Asn 


Tyr 


Ala 


Asp 








300 










250 


GCC 














He 


Lys 


Val 


Pro 


Ser 


Asp 


Leu 


Asp 




10 










15 






Phe 


Val 


Asn Trp 


Val 


Ala 


Glu 


25 










30 








Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 








45 








VJJ. u 


Lys 


Leu Gin Arg 


Aso 


Phe 


Leu 






60 










Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


Val 






75 










80 


Phe 


His 


Met 


His 


Val 


Leu 


Val 


Glu 




90 










95 




Leu 


Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 


105 










110 






Tvr 


Arg Gly 


He 


Glu 


Pro 


Thr 


Leu 










125 








Thr 


Arg Asn Gly Ala 


Gly 

mm 


Gly 


Gly 








140 










He 


Pro Asn Tyr Leu 


Leu 


Pro 


Lys 






155 










160 


Tro 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 










175 




Arcr 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 


185 








190 






Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 








205 








Arg 


Ser Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 






220 










Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


235 










240 


Ser 


Ala 


He 


Ser 


Phe 


Asn 


Ala 


Ala 




250 










255 




Ala 


Ala 


Leu Asp Asn 


Ala 


Gly 


Lys 


265 










270 






Pro 


Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 








285 








Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 








300 










Ala 


Ser 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 






315 










320 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 
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Phe 


Tyr 


Gly Cys 


Val 


Asn Trp 


Thr 


Asn 


Glu Asn 


Phe 


Pro 


Phe Asn Asp 




355 










360 








365 




Thr 


Ala 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly Lys 


Met 


370. 






375 








380 










Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


385 










390 








395 










400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin He 


Asp 


Pro 


Thr 


Pro 


Val 






405 










410 








415 




lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala Val 


He Asp 


Gly Asn Ser 






420 










425 








430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 






435 










440 








445 






Gin 


Glu 


I*eu 


Thr Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe Gly Lys 


Val 


Thr 


Lys 




450 










455 








460 








Val 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys Asp 


His 


Val 


Val 


Glu 


465 






470 








475 










480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 










485 










490 








495 




Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 






500 








505 








510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala Ser 


He 


Asn 


Tyr Ala Asp 






515 










520 








525 








Arg 


Tyr 


Gin Asn Lys 


Cys 


Ser 


Arg 


His 


Val Gly 


Met 


Asn 


Leu 


Met 


Leu 


530 










535 








540 






He 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 


Met 


Asn Gin 


Asn 


Ser 


Asn 


Cys 


545 






550 








555 










560 


Phe 


Thr 


His 


Gly 


Gin 


Lys 


Asp 


Cys 


Leu 


Glu Cys 


Phe 


Pro 


Val 


Ser 


Glu 








565 










570 








575 




Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys Ala Tyr Gin Lys 


Leu 


Cys 


Tyr 








580 










585 








590 






He 


His 


His 


He 


Met 


Gly Lys 


Val 


Pro 


Asp Ala 


Cys 


Thr 


Ala 


Cys 


Asp 






595 










600 








605 








Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 


Cys 


He Phe 


Glu 


Gin 










610 








615 








620 











<210> 70 
<211> 397 
<212> PRT 

<213> Artifioial Sequence 
<220> 

<223> Mutant rep protein: rep52 250 GCC 



<400> 70 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser Glu Lys 


1 








5 








10 






15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Ala 


He Ser 


Phe 


Asn Ala Ala 






20 








25 








30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 






35 








40 








45 


Gly Gin Gin 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 




50 








55 








60 




Leu Glu Leu 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


He 


Tyr Lys 


He 


65 








70 










75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 




85 










90 






95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly Pro Ala 




100 








105 








110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr Val Pro 






115 








120 








125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Asn Asp 
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130 135 140 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
X 45 150 155 160 

Lvs Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 365 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
385 390 395 

<210> 71 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 250 GCC 



15 

Asn Trp Val Ala Gli 
30 

Asp Leu Asn Leu Il< 
45 

Gin Arg Asp Phe Lei 
60 

Ala Leu Phe Phe Va 

80 

His Val Leu Val Gl^ 

95 

Phe Leu Ser Gin II 
110 

He Glu Pro -Thr Le 
125 

Gly Ala Gly Gly Gl 
140 



<400> 71 










Val He Lys Val 


Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


1 






5 






10 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp Ser Phe Val 








20 








25 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp Ser Asp Met 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala Glu Lys Leu 




50 










55 


Lys Ala Pro Glu 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


65 






70 




75 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr Phe His Met 








85 






90 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val Leu Gly Arg 






100 








105 


Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn 


130 








135 
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Asn 


Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro Asn Tyr Leu 


Leu Pro 


Lys 


145 






150 








155 






160 


Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr Asn Met 


Glu 


Gin Tyr 


Leu 






165 








170 




175 


His 


Ser 


Ala 


Cys Leu 
180 


Asn 


Leu 


Thr 


Glu 


Arg 
185 


Lys Arg Leu 


Val 


Ala Gin 
190 


Leu. 


Thr 


His Val 


Ser Gin Thr Gin Glu - Gin Asn Lys Glu Asn Gin 


Asn 




195 








200 






205 






Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He Arg 


Ser Lys Thr Ser Ala Arg 


Tyr 


210 






215 






220 








Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys Gly He 


Thr 


Ser Glu 


Lys 


225 






230 








235 




Asn Ala 


240 


Gin 


TlTD 


He Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Ala He Ser 


Phe 


Ala 






245 








250 




255 




Ser 

W -1— 


Asn 


Ser Arg 


Ser 


Gin He Lys Ala Ala Leu Asp Asn Ala Gly 


Lys 






260 










265 






270 


Gin 


He 

-i* 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val 


Gly Gin 






275 






280 






285 








Val 


Glu Asp 


lie 


Ser 


Ser 


Asn Arg 


He Tyr Lys 


He 


Leu Glu 


Leu 




290 






295 






300 






Ala 


Asn 


Glv 


Tvr Asp 


Pro Gin Tyr Ala Ala 


Ser Val Phe 


Leu Gly Trp 


305 




310 








315 






320 


Thr 


TjVS 

U Y * 


Lvs Phe 


Gly 


Lys 


Arg Asn 


Thr 


He Trp Leu 


Phe 


Gly Pro 


Ala 




325 










330 




335 




Thr 


Thr 


Gly Lys 
340 


Thr 


Asn 


He 


Ala 


Glu 
345 


Ala He Ala 


His 


Thr Val 
350 


Pro 


Phe 


Tyr 


Gly Cys 


Val 


Asn Trp 


Thr 


Asn 


Glu Asn Phe 


Pro 


Phe Asn 


Asp 




355 








360 






365 




Ala 


Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly Lys 


Met Thr 


370 






375 






380 




Lys Val 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Arg 


385 








390 








395 






400 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin He Asp 


Pro 


Thr Pro 


Val 




405 










410 




415 




He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala Val He 


Asp 


Gly Asn 


Ser 




420 










425 






430 


Phe 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met 


Phe Lys 






435 








440 






445 




Gin 


Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His 


Asp 


Phe Gly Lys 


Val 


Thr Lys 




450 




455 






460 




Val Glu 


Val 


Glu 


Val 


Lys Asp 


Phe 


Phe Arg Trp Ala Lys Asp His 


Val 


465 






470 








475 






480 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys Gly Gly Ala Lys 


Lys 


Arg Pro 


Ala 








485 










490 




495 


Val 


Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys Arg Val 


Arg 


Glu Ser 






500 








505 






510 




Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 


Glu 


Ala Ser He Asn Tyr Ala 


Asp 






515 








520 






525 






Arg 


Leu 


Ala Arg 


Gly His 


Ser 


Leu 












530 








535 















<210> 72 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 250 GCC 

<400> 72 , 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
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1 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Ala He Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 ~ 105 HO 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

115 " 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 " 135 140 

Cvs Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lvs Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

* 165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 " 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly His Ser Leu 
305 " 310 

<210> 73 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 258 GCC 

<400> 73 „ 

Slv Phe Tvr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

10 15 
Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

25 30 
Pro Asp Ser Asp Met Asp Leu Asn Leu He 

40 45 
Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
55 60 

Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 

75 80 
Ser Tyr Phe His Met His Val Leu Val Glu 

90 95 



Thr 


Ala Gly 


Phe 


Tyr 


Glu 


1 






5 




Glu 


His Leu 


Pro 


Gly 


He 






20 






Lys 


Glu Trp 


Glu 


Leu 


Pro 


35 








Glu 


Gin Ala 


Pro 


Leu 


Thr 




50 








Thr Glu Trp 


Arg 


Arg 


Val 


65 








70 


Gin 


Phe Glu 


Lys 


Gly 


Glu 






85 
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idx 


T 1 Vi -y~ 




Val 


Lys 


Ser 


Met 


Val 


Leu 






100 










105 


7\ -v~ /-T 

Arg 




jjys 


Leu 


He 


Gin Arg 


He 


Tyr 




J. J. 3 










120 




y— » 

riO 






Phe 


Ala 


Val 


Thr 


Lys 


Thr 




XJ u 








135 






Asn 


jjys 


val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


14 i> 








150 








Thr 


Gin 


Pro 


Glu 


Leu 


Gin Trp Ala Trp 










165 










Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 






180 










185 


Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 
200 


Glu 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 




2X0 








215 






Met 


Glu 


Leu 


Val Gly 


Trp 


Leu 


Val 


Asp 


225 










230 








Gin 


Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 








245 










Ser 


Ala 


Ser 


Arg 
260 


Ser 


Gin 


He 


Lys 


Ala 
265 


lie 


Met 


Ser 

-j! /b 


Leu 


Thr 


Lys 


Thr 


Ala 
280 

* 


Pro 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser Asn Arg 












295 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin Tyr Ala Ala 


305 










310 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg Asn 


Thr 






325 










Thr 


Thr 


Qjiy 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






340 










345 


Pne 


lyr 


iaiy 


Cys 


Val 


Asn Trp 


Thr 


Asn 




j j j 










360 




uys 


Val 




Lys 


Met 


Val 


He 


Trp 


Trp 


-inn 
j / u 










375 






lSy 3 


Vai 


val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


•JOC 
jOj 










390 








val 


Asp 


lain 


IJys 


Cys 
405 


Lys 


Ser 


Ser 


Ala 


lie 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Pro 
440 


Leu 


Glu 


Leu 


Thr 


Arg Arg 


Leu Asp 


His 


Asp 




450 










455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


465 










470 








Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Gly 


Pro 


Ser 


Asp 


Ala Asp 


He 


Ser 


Glu 


Pro 








500 










505 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 






515 










520 




Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg 


His 


530 










535 






Phe 


Pro 


Cys 


Arg Gin 


Cys 


Glu Arg Met 


545 










550 








Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu 










565 










Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 
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Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 










110 






Arg Gly 


He 


Glu 


Pro 


Thr 


Leu 








125 












Gly Ala Gly Gly 


Gly 






140 










Pro 


Asn 

ire 
155 


Tyr 


Leu 


Leu 


Pro 


Lvs 
160 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 










175 




Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 

ill 0 








190 






Gin 


Asn 


Lys 


Glu 
205 


Asn 


Gin 


Asn 


Ser 


Lys 


Thr 


Ser Ala Arg 


Tvr 






220 










Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lvs 


235 










240 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 


250 










255 




Ala 


Leu 


Asp Asn Ala Gly 












270 






Asp 


Tyr 


Leu Val Gly Gin 


Gin 

\J J- XX 








285 








He 


Tyr 


Lys 
300 


He 


Leu 


Glu 


Tx&M 


Ser 


Val 


Phe 


Leu Gly Trp 


Ala 




315 










320 


He 


Trp 


Leu 


Phe Gly 


Pro 


Ala 


330 










335 




Ala 


He 


Ala 


His 


Thr 
350 


Val 


Pro 


Glu 


Asn 


Phe 


Pro 
365 


Phe 


Asn 


Asp 


Glu 


Glu 


Gly 
380 


Lys 


Met 


Thr 


Ala 


Leu Gly 


Gly 


Ser Lys 


Val 


Arg 




395 










400 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Ala 


Val 


He Asp Gly Asn 


Ser 










430 






Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 








445 








Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 






460 










Lys 


Asp 


His 


Val 


Val 


Glu 


Val 


475 










480 


Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 










495 




Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 










510 






Ala 


Ser 


He 


Asn Tyr 


Ala 


Asp 








525 








Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 




540 










Asn 


Gin 
555 


Asn 


Ser 


Asn 


He 


Cys 
560 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 


570 








575 




Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 
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580 

He His His He 
595 

Leu Val Asn Val 
610 



Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



590 

Thr Ala Cys Asp 

605 

Gin 



<210> 74 

<211> 397 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep52 258 GCC 



<400> 74 




Met 


Glu 


Leu 


Val 


1 








Gin Trp 


lie 


Gin 








20 


Ser 


Ala 


Ser 


Arg 






35 




He 


Met 


Ser 


Leu 




50 






Pro 


Val 


Glu 


Asp 


65 








Asn Gly 


Tyr 


Asp 


Thr 


Lys 


Lys 


Phe 








100 


Thr 


Thr 


Gly 


Lys 






115 




Phe 


Tyr 


Gly 


Cys 




130 






Cys 


Val 


Asp 


Lys 


145 


Val 






Lys 


Val 


Glu 


Val 


Asp 


Gin 


Lys 








T80 


He 


Val 


Thr 


Ser 






195 




Thr 


Thr 


Phe 


Glu 




210 






Glu 


Leu 


Thr 


Arg 


225 








Glu 


Val 


Lys 


Asp 


Glu 


His 


Glu 


Phe 








260 


Pro 


Ser 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 


Ser 


Gin 


Pro 


Val 






355 





Gly Trp 


Leu 


val 


5 








Glu Asp 


vj_Ln 


Aid 


Ser 


Gin 


lie 


Lys 








40 


Thr 


Lys 


Thr 


Ala 




55 




He 


Ser 


Ser 


Asn 




70 






Pro 


Gin 


Tyr 


Ala 


85 








Gly Lys 


Arg 


Asn 


Thr 


Asn 


He 


Ala 








120 


Val 


Asn 


Trp 


Thr 






135 




Met 


Val 


He 


Trp 




150 






Ser 


Ala 


Lys 


Ala 


165 








Cys 


Lys 


Ser 


Ser 


Asn 


Thr 


Asn 


Met 








200 


His 


Gin 


Gin 


Pro 






215 




Arg 


Leu 


Asp 


His 




230 






Phe 


Phe 


Arg 


Trp 


245 








Tyr 


Val 


Lys 


Lys 


Asp 


He 


Ser 


Glu 






280 


Thr 


Ser 


Asp 


Ala 






295 




Lys 


Cys 


Ser 


Arg 




310 






Gin 


Cys 


Glu 


Arg 


325 








Gin 


Lys 


Asp 


Cys 


Ser 


Val 


Val 


Lys 








360 



Asp Lys Gly He 
10 

Ser Tyr He Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 

60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 

His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 

Lys Ala Tyr Gin 



Thr 


Ser 


Glu 


Lys 






15 




Phe 


Asn 


Aid 


AT a 

AJLd 




30 






Asn 


Ala 






45 








Val 


Gly 


Gl n 

VJJ.11 


Gin 


He 


Leu 


Glu 


Leu 








on 

O \J 


Leu 


Gly 


Trp 


Ala 






95 




Phe 


Gly 


Pro 


Ala 




110 






His 


Thr 


Val 


Pro 


125 








Pro 


Phe 


Asn 


Asp 


Lys 


Met 


Thr 


Ala 






160 


Ser 


Lys 


Val 


Arg 






175 




Pro 


Thr 


Pro 


Val 




190 






Asp Gly 


Asn 


Ser 


205 








Met 


Phe 


Lys 


Phe 


Val 


Thr 


Lys 


Gin 






240 


Val 


Val 


Glu 


Val 






255 




Lys 


Arg 


Pro 


Ala 




270 






Arg 


Glu 


Ser 


Val 


285 








Asn 


Tyr 


Ala 


Asp 


Asn 


Leu 


Met 


Leu 








320 


Ser Asn 


He 


Cys 






335 




Pro 


Val 


Ser 


Glu 




350 






Lys 


Leu 


Cys 


Tyr 


365 
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Ile His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
385 390 395 



<210> 75 

<211> 536 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep68 258 GCC 



<400> 75 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 






dlu 


Thr 


Gin 


Pro 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Ala 


Ser 


Arg 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 






280 


He 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 


325 








Thr 


Asn 


He 


Ala 


Val 


Asn 


Trp 


Thr 








360 


Met 


Val 


He 


Trp 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 
Trp Glu Glu Gly 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 
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370 










375 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


385 










390 








Val 


Asp 


Gin 


Lys 


Gys 
405 


Lys 


Ser 


Ser 


Ala 


lie 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Pro 
440 


Leu 


Glu 


Leu 


Thr Arg Arg 


Leu Asp His Asp 




450 










455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala 


465 










470 








Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Gly 


Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 








500 










505 


Ala 


Gin 


Pro 
515 


Ser 


Thr 


Ser 


Asp 


Ala 
520 


Glu 


Arg 


Leu 
530 


Ala 


Arg 


Gly 


His 


Ser 
535 


Leu 





<210> 76 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 258 



<400> 76 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


Asp 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 










25 


Ser 


Ala 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 






35 








40 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 










55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser Asn Arg 


65 








70 








Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr Ala Ala 










85 










Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg Asn 


Thr 








100 










105 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






115 










120 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 




130 










135 






Cys 


val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


145 










150 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 








180 










185 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 






195 










200 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 




210 










215 






Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 



225 230 
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380 



Leu 


Gly 


Gly 


Ser Lys Val 


Arg 




395 






400 


Gin 


He 


Asp 


Pro Thr Pro 


Val 


410 






415 




Ala 


Val 


He 


Asp Gly Asn 


Ser 








430 




Gin 


Asp 


Arg 


Met Phe Lys 


Phe 








445 




Phe 


Gly 


Lys 


Val Thr Lys 


Gin 






460 






Lys 


Asp 


His 


Val Val Glu 


Val 




475 






480 


Gly 


Ala 


Lys 


Lys Arg Pro 


Ala 


490 






495 




Lys 


Arc? 


Val 


Arg Glu Ser 


Val 








510 




Ala 


Ser 


He 


Asn Tyr Ala 


Asp 








525 




GCC 










Lys 


Gly 


He 


Thr Ser Glu 


Lys 


10 






15 




Tyr 


He 


Ser 


Phe Asn Ala 


Ala 






30 




Ala 


Leu 


Asp 


Asn Ala Gly 


Lys 








45 




Asp 


Tyr 


Leu 


Val Gly Gin 


Gin 






60 






He 


Tyr 


Lys 


He Leu Glu 


Leu 




75 






80 


Ser 


Val 


Phe 


Leu Gly Trp 


Ala 


90 






95 




He 


Trp 


Leu 


Phe Gly Pro 


Ala 








110 




Ala 


He 


Ala 


His Thr Val 


Pro 








125 




Glu 


Asn 


Phe 


Pro Phe Asn 


Asp 






140 






Glu 


Glu 


Gly 


Lys Met Thr 


Ala 




155 






160 


Leu 


Gly 


Gly 


Ser Lys Val 


Arg 


170 






175 




Gin 


He 


Asp 


Pro Thr Pro 


Val 






190 




Ala 


Val 


He 


Asp Gly Asn 


Ser 








205 




Gin 


Asp 


Arg 


Met Phe Lys 


Phe 






220 






Phe 


Gly 


Lys 


Val Thr Lys 


Gin 




235 






240 
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Glu Val Lys Asp Phe Phe Arg Trp 

245 

Glu His Glu Phe Tyr Val Lys Lys 

260 

Pro Ser Asp Ala Asp He Ser Glu 
275 280 
Ala Gin Pro Ser Thr Ser Asp Ala 

290 295 
Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 77 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



Ala Lys Asp His Val Val Glu Val 

250 255 
Gly Gly Ala Lys Lys Arg Pro Ala 
265 270 
Pro Lys Arg Val Arg Glu Ser Val 

285 

Glu Ala Ser He Asn Tyr Ala Asp 

300 



<220> 

<223> Mutant rep protein: 78 260 GCG 



<400> 77 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Ala 








260 


He 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lys 


Lys 


Phe 



Tyr 


Glu 


He 


Val 


5 

Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 






280 


He 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 



He 


Lys 


Val 


Pro 




10 






Ser 


Phe 


Val 


Asn 


25 








Ser 


Asp 


Met 


Asp 


Glu 


Lys 


Leu 


Gin 






60 


Ala 


Pro 


Glu 


Ala 






75 




Phe 


His 


Met 


His 




90 






Leu 


Gly 


Arg 


Phe 


105 








Tyr 


Arg 


Gly 


He 


Thr 


Arg 


Asn Gly 








140 


He 


Pro 


Asn 


Tyr 






155 




Trp 


Thr 


Asn 


Met 


170 






Arg 


Lys 


Arg 


Leu 


185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


Thr 








220 


Asp 


Lys 


Gly 


He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu Asp 


265 








Pro 


Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 






315 




Thr 


He 


Trp 


Leu 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
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325 



Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






340 










Phe 


Tyr 


Gly 


Cys 


Val 


Asn Trp 


Thr 




355 










360 


Cys 


Val 
370 


Asp 


Lys 


Met 


Val 


He 
375 


Trp 


Lys 


Val 


Val 


Glu Ser Ala Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 
405 


Lys 


Ser 


Ser 


He 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Pro 
440. 


Glu 


Leu 


Thr 


Arg Arg 


Leu 


Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Pro 


Ser 


Asp 


Ala Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 






515 










520 


Arg 


Tyr 
530 


Gin 


Asn 


Lys 


Cys 


Ser 
535 


Arg 


Phe 


Pro 


Cys 


Arg Gin Cys 


Glu 


Arg 


545 










550 






Phe 


Thr 


His 


Gly Gin Lys 


Asp 


Cys 










565 








Ser 


Gin 


Pro 


Val 
580 


Ser 


Val 


Val 


Lys 


He 


His 


His 
595 


He 


Met 


Gly 


Lys 


Val 
600 


Leu 


Val 
610 


Asn 


Val 


Asp 


Leu 


Asp 
615 


Asp 



<210> 78 
<211> 397 
<?212 > PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 



<400> 78 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








20 










Ser 


Asn 


Ser 


Ala 


Ser 


Gin 


He 


Lys 






35 










40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 



100 



-100- 





330 










335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 
365 


Phe 


Asn 


Asp 


Trp 


Glu 


Glu Gly Lys 


Met 


Thr 


Ala 








380 










He 


Leu 


Gly Gly Ser 


Lys 


Val 


Arg 






395 










400 


Ala 


Gin 
410 


He 


Asp 


Pro 


Thr 


Pro 
415 


Val 


Cys 


Ala 


Val 


He 


Asp 


Gly Asn 


Ser 


425 










430 






Leu 


Gin 


Asp 


Arg 


Met 
445 


Phe 


Lys 


Phe 


Asp 


Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 






460 










Ala 


Lys 


Asp 
475 


His 


Val 


Val 


Glu 


Val 
480 


Gly Gly 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 




490 










495 




Pro 


Lys 


Arg Val Arg 


Glu 


Ser 


Val 


505 










510 






Glu 


Ala 


Ser 


He 


Asn 


Tyr Ala 


Asp 










525 








His 


Val 


Gly 


Met 
.540 


Asn 


Leu 


Met 


Leu 


Met 


Asn 


Gin 
555 


Asn 


Ser 


Asn 


He 


Cys 
560 


Leu 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 




570 








575 




Lys 


Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 


585 










590 






Pro Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 










605 








Cys 


He 


Phe 


Glu 


Gin 












620 











260 


GCG 










Asp 


Lys 


Gly He Thr 


Ser Glu 


Lys 




10 






15 




Ser Tyr 


He Ser 


Phe 


Asn Ala 


Ala 


25 








30 




Ala 


Ala 


Leu Asp 


Asn 


Ala Gly 


Lys 






45 






Pro Asp 


Tyr Leu 


Val 


Gly Gin 


Gin 






60 








Arg 


He 


Tyr Lys 


He 


Leu Glu 


Leu 






75 






80 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp 


Ala 




90 






95 




Thr 


He 


Trp Leu 


Phe 


Gly Pro 


Ala 


105 






110 
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Thr 


Thr 


Gly 


Lys 






115 




Phe 


Tyr 


Gly 


Cys 




^ *s n 

130 






Cys 


Val 


Asp 


Lys 


145 








Lys 


Val 


Val 


Glu 


Val 


Asp 


Gin 


Lys 






180 


He 


Val 


Thr 


Ser 






195 




Thr 


Thr 


Phe 


Glu 




210 






Glu 


Leu 


Thr 


Arg 


225 








Glu 


Val 


Lys 


Asp 


Glu 


His 


Glu 


Phe 








260 


Pro 


Ser 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 


Ser 


Gin 


Pro 


Val 






355 




He 


His 


His 


He 




370 






Leu 


Val 


Asn 


Val 


385 









inr 




Tip 

lie 


.rV-L. cl 








120 


vai 




xrp 


Thr 






135 




Met 


vai 


X -L fci 


xrp 




1 JU 






Ser 


Ala 


Lys 




165 








Cys 


Lys 


Ser 


Ser 


Asn 


Thr 


Asn 


Met 










His 


Gin 


Gin 


Pro 






ID 




Arg 


Leu 


Asp 


His 




230 






Phe 


Phe 


Arg 


Trp 


245 








Tyr 


Val 


Lys 


Lys 


Asp 


He 


Ser 


Glu 






280 


Thr 


Ser 


Asp 


Ala 






295 




Lys 


Cys 


Ser 


Arg 




310 






Gin 


Cys 


Glu 


Arg 


325 








Gin 


Lys 


Asp 


Cys 


Ser 


Val 


Val 


Lys 








360 


Met 


Gly 


Lys 


Val 






375 




Asp 


Leu 


Asp 


Asp 




390 







Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 

His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 

Lys Ala Tyr Gin 

Pro Asp Ala Cys 

380 

Cys He Phe Glu 
395 



His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 

Asn Leu Met Leu 

320 

Ser Asn He Cys 
335 

Pro Val Ser Glu 
350 

Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<210> 79 

<211> 536 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep68 260 GCG 



<400> 79 



Thr 


Ala 


Gly 


Phe 


1 
Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 




35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
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115 








120 


XT J. 


7\ en 


Trp Phe 


Ala 


Val 


Thr 


Lvs 




130 






135 




"Ti ~n 
HSU 


iiys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


1 45 

JL*± J 






150 






iax 




Pro Glu 


Leu 


Gin 


Trp 


Ala 








165 








Ser 


AJ-3. 


Cys Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Tnr 


His Val 


Ser 


Gin 


Thr 


Gin 






195 








200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




OTA 






215 




Met 


G1U 


Leu Val 


Gly Trp 


Leu 


Val 


o o c 








230 






Gin 


Trp 


He Gin Glu Asp 


Gin 


Ala 






245 








Ser 


Asn 


Ser Ala 


Ser 


Gin 


He 


Lys 






260 










lie 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 






275 








280 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




O Q f\ 

*z y u 






295 




Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr Ala 


O f\ c 






310 






Thr 


Lys 


Lys Phe 


Gly Lys 


Arg Asn 








325 








Thr 


Tnr 


Gly Lys 


Thr 


Asn 


He 


Ala 






340 










Til* « 

Pne 


ry^r 


Gly Cys 


Val 


Asn 


Trp 


Thr 




355 








360 


trys 


vaJL 


Asp Lys 


Met 


Val 


He 


Trp 


370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 






405 








He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






420 










Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 






435 








440 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 




450 








455 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


465 








470 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 








485 








Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 






500 










Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 






515 








520 


Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 


530 








535 





<210> 80 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



125 



Thr 


Arg 


Asn Gly 


Ala 


Gly 


Gly 


Gly 






140 










He 


Pro 


Asn Tyr 


Leu 


Leu 


Pro 


Lys 






155 








160 




Thr 


Asn Met 


Glu 


Gin 


Tyr 


Leu 


170 








175 




firo 


Lvs 


Ara Leu 


Val 


Ala 


Gin 


His 


185 








190 






Glu 


Gin 


Asn Lys 


Glu 


Asn 


Gin 


Asn 






205 












Lvs Thr 


Ser 


Ala 


Arg 


Tyr 




220 

M «U W 












T ,\rci 
j_j y o 


Glv He 


Thr 

*1» All 


Ser 


Glu 


Lys 


235 








240 


cjpr 




lie Ser 


Phe 


Asn 


Ala 


Ala 




250 

J v 








255 




Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 


zoo 








270 






Pro 


ASTD 


Tvr Leu 


Val 


Gly 


Gin 


Gin 








285 








Ay* a 


He 


Tvr Lvs 


He 


Leu 


Glu 


Leu 






300 










Ala 


Ser 

ttw.1 Jh* 


Val Phe 


Leu 


Gly 


Trp 


Ala 






315 








320 


Thr 


He 


Tro Leu 


Phe 


Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 


345 








350 






Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 








365 








Trp 


Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 




380 










He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 






395 








400 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 




410 








415 




Cys 


Ala 


Val He 


Asp 


Gly 


Asn 


Ser 


425 








430 






Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 






445 








Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 






460 










Ala 


Lys 


Asp His 


Val 


Val 


Glu 


Val 




475 








480 


Gly 


Gly 


Ala Lys 


Lys 


Arg 


Pro 


Ala 


490 








495 




Pro 


Lys 


Arg Val 


Arg 


Glu 


Ser 


Val 


505 






510 






Glu 


Ala 


Ser He 


Asn 


Tyr 


Ala 


Asp 



525 



260 GCG 
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<400> 80 





bill 


Leu 


Val 




Ttt> 


Leu Val 


1 








b 




f*1 ft Al a 
vjXIl Ala 


Gin 


Trp 


lie 


Gin 


G1U 


ASp 






20 








Ser 


Asn 


Ser 


Ala 


Ser 


bin 


xxe jjys 






35 










lie 


Met 


Ser 


Leu 


Thr 


jjys 


1XXX i-VJLct 




50 










c c 


Pro 


Val 


Glu Asp 


lie 




oer Asn 


65 










70 




As n 


Gly 


Tyr Asp 


Pro 


Gin 


iyr Axa 








85 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg Asn 








100 








Tlir 


Thr 


Gly Lys 


Thr 


Asn 


lie Ala 






115 








120 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


irp lnr 




130 










135 


Cys 


Val 


Asp 


Lys 


Met 


Val 


lie Trp 


145 










150 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys Ala 








165 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser Ser 








180 








lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn Met 






195 








200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin Pro 




210 










215 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp His 


225 










230 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg Trp 










245 






Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys Lys 








260 








Pro 


Ser 


Asp 


Ala 


Asp 


lie 


Ser Glu 






275 








280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp Ala 




290 










295 


Arg 


Leu 


Ala Arg 


Gly His 


Ser Leu 


305 










310 





Asp 


Lys 


Gly 


He 


Thr 


Ser Glu Lys 




10 








15 


Ser 


Tyr 


He 


Ser 


Phe 


Asn Ala Ala 


25 








30 


Ala 


Ala 


Leu Asp 


Asn Ala Gly Lys 










45 




Pro Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 








60 






Arg 


He 


Tyr 


Lys 


He 


Leu Glu Leu 






75 






80 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp Ala 




90 








95 


Thr 


He 


Trp 


Leu 


Phe Gly Pro Ala 


105 








110 


Glu 


Ala 


He 


Ala 


His 


Thr Val Pro 










125 




Asn 


Glu 


Ash 


Phe 


Pro 


Phe Asn Asp 








140 






Trp Glu 


Glu Gly 


Lys 


Met Thr Ala 






155 






160 


He 


Leu 


Gly 


Gly 


Ser 


Lys Val Arg 




170 






175. 


Ala 


Gin 


He 


Asp 


Pro 


Thr Pro Val 


185 








190 


Cys 


Ala 


Val 


He 


Asp 


Gly Asn Ser 








205 




Leu 


Gin 


Asp 


Arg 


Met 


Phe Lys Phe 






220 






Asp 


Phe 


Gly 


Lys 


Val 


Thr Lys Gin 




235 






240 


Ala 


Lys 


Asp 


His 


Val 


Val Glu Val 




250 






255 


Gly Gly 


Ala 


Lys 


Lys 


Arg Pro Ala 


265 










270 


Pro 


Lys 


Arg 


Val 


Arg 


Glu Ser Val 






285 




Glu 


Ala 


Ser 


He 


Asn Tyr Ala Asp 








300 







<210> 81 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 263 GCC 



<400> 81 

Thr Ala Gly Phe Tyr Glu He Val 

1 5 
Glu His Leu Pro Gly He Ser Asp 

20 

Lys Glu Trp Glu Leu Pro Pro Asp 

35 40 
Glu Gin Ala Pro Leu Thr Val Ala 

50 55 
Thr Glu Trp Arg Arg Val Ser Lys 
65 70 



He 


Lys 
10 


Val 


Pro 


Ser Asp 


Leu 
15 


Asp 


Ser 


Phe 


Val 


Asn 


Trp Val 


Ala 


Glu 


25 








30 






Ser 


Asp 


Met 


Asp 


Leu Asn 


Leu 


He 








45 






Glu 


Lys 


Leu 


Gin 


Arg Asp 


Phe 


Leu 






60 








Ala 


Pro 


Glu 
75 


Ala 


Leu Phe 


Phe 


Val 
80 



WO 03/018820 



PCT/EB02/04087 



-104- 



Gin 


Phe 


G±U 


Lys Gly 


Glu 


Ser 


xyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg Glu 


Lys 


Leu 


He 


Gin Arg 


lie 






115 










TOO 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Jbys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 








150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


■j« -» „ 
Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










r\ f\ 

200 


Pro 


Asn 


Ser 


Asp Ala 


Pro 


Val 


He 




210 










215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 










245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


Ala 


Lys 








260 










lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










*"l O ft 

280 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


"ii _ — 

Asn 




290 










295 




Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


305 










310 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








325 








Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Tyr 


Gly 


Cys 


Val 


T» 

Asn 


Trp 


Thr 




355 










360 


Cys 


Val 


■i ____ 

Asp 


Lys 


Met 


Val 


He 


Trp 


370 










375 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 










Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 






435 










440 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 






515 










520 


Arg 


Tyr 


Gin 


Asn Lys 


Cys 


Ser 


Arg 


530 










535 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


545 








550 






Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 



Phe 


His 


Met 


His Val Leu 


Val 


Glu 




90 






95 




Leu Gly 


Arg 


Phe Leu Ser 


Gin 


He 


105 






110 






Tyr Arg 


Gly 


He Glu Pro 


Thr 


KjL 








125 






Thr Arg 


Asn Gly Ala Gly Gly 


\JX-L Jf 








140 






He 


Pro 


Asn 


Tyr Leu Leu 


Pro 


1J jr s? 






155 






lo\/ 


Trp 


Thr 


Asn 


Met Glu Gin 


Tyr 


licU 


170 






175 




Arg 


Lys 


Arg 


Leu Val Ala 


Gin 


ni o 


185 






190 






Glu 


Gin 


Asn 


Lys Glu Asn 


Gin 


nbll 








205 






Arg 


Ser 


Lys 


Thr Ser Ala Arg 


Ayr 






220 






Asp Lys 


Gly 


He Thr Ser 


Glu 


jjy s 






235 








Ser 


Tyr 


He 


Ser Phe Asn 


Ala 


2X1 

rt.-L d 




250 






255 




Ala 


Ala 


Leu Asp Asn Ala Gly 


iiyb 


265 






270 






Pro 


Asp 


Tyr Leu Val Gly Gin 


\3 JLX1 






285 






Arg 


He 


Tyr 


Lys He Leu 


Glu 


JJCU 








300 






Ala 


Ser 


Val 


Phe Leu Gly Trp 


Ala 






315 






320 


Thr 


He 


Trp 


Leu Phe Gly 


Pro 


Ala 




330 




335 




Glu 


Ala 


He 


Ala His Thr 


Val 


Pro 


345 






350 






Asn 


Glu 


Asn 


Phe Pro Phe 


Asn 










365 






Trp 


Glu 


Glu Gly Lys Met 


Thr 


Ala 








380 






He 


Leu 


Gly Gly Ser Lys 


Val 








395 






400 


Ala 


Gin 


He Asp Pro Thr 


Pro 


Val 




410 






415 




Cys Ala 


Val 


He Asp Gly Asn 


Ser 


425 






430 






Leu 


Gin 


Asp 


Arg Met Phe 


Lys 


Phe 






445 






Asp 


Phe 


Gly Lys Val Thr 


Lys 


Gin 








460 






Ala Lys 


Asp 


His Val Val 


Glu 


Val 






475 






480 


Gly Gly 


Ala 


Lys Lys Arg 


Pro 


Ala 




490 






495 




Pro 


Lys 


Arg 


Val Arg Glu 


Ser 


Val 


505 




510 






Glu 


Ala 


Ser 


He Asn Tyr Ala 


Asp 








525 






His 


Val 


Gly Met Asn Leu Met 


Leu 








540 






Met 


Asn 


Gin 


Asn Ser Asn 


He 


Cys 






555 






560 


Leu 


Glu 


Cys 


Phe Pro Val 


Ser 


Glu 
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Ser Gin Pro Val 

580 

lie His His He 
595 

Leu Val Asn Val 
610 



565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



570 

Lys Ala Tyr Gin 
585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



575 

Lys Leu Cys Tyr 
590 

Thr Ala Cys Asp 

605 

Gin 



<210> 82 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 263 GCC 



<400> 82 




Met 


Glu 


Leu 


Val 


1 








m n 


xxrp 


Tie 


Gin 








Ser 


Asn 


Ser 


Arg 






35 




He 


Met 


Ser 


Leu 




50 






Pro 


Val 


Glu 


Asp 


65 








Asn 


Gly 


Tyr 


Asp 


Thr 


Lys 


Lys 


Phe 








100 


Thr 


Thr 


Gly 


Lys 






115 




Phe 


Tyr 


Gly 


Cys 




130 






Cys 


Val 


Asp 


Lys 


145 








Lys 


Val 


Val 


Glu 


Val 


Asp 


Gin 


Lys 








180 


He 


Val 


Thr 


Ser 






195 




Thr 


Thr 


Phe 


Glu 




210 






Glu 


Leu 


Thr 


Arg 


225 








Glu 


Val 


Lys 


Asp 


Glu 


His 


Glu 


Phe 








260 


Pro 


Ser 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 



Gly 


Trp 


Leu 


Val 


5 








Glu 


Asp 


Gin 


Ala 


Ser 


Gin 


Ala 


Lys 








40 


Thr 


Lys 


Thr 


Ala 




55 




He 


Ser 


Ser 


Asn 




70 






Pro 


Gin 


Tyr 


Ala 


85 








Gly 


Lys 


Arg 


Asn 


Thr 


Asn 


He 


Ala 








120 


Val 


Asn 


Trp 


Thr 






135 




Met 


Val 


He 


Trp 




150 






Ser 


Ala 


Lys 


Ala 


165 








Cys 


Lys 


Ser 


Ser 


Asn 


Thr 


Asn 


Met 








200 


His 


Gin 


Gin 


Pro 






215 




Arg 


Leu 


Asp 


His 




230 






Phe 


Phe 


Arg 


Trp 


245 








Tyr 


Val 


Lys 


Lys 


Asp 


He 


Ser 


Glu 






280 


Thr 


Ser 


Asp 


Ala 






295 




Lys 


Cys 


Ser 


Arg 




310 






Gin 


Cys 


Glu 


Arg 


325 








Gin 


Lys 


Asp 


Cys 



Asp Lys Gly He 
10 

Ser Tyr He Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 

60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 

His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 



Thr Ser Glu Lys 
15 

Phe Asn Ala Ala 
30 

Asn Ala Gly Lys 
45 

Val Gly Gin Gin 

He Leu Glu Leu 

80 

Leu Gly Trp Ala 
95 

Phe Gly Pro Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 

Asn Leu Met Leu 

320 

Ser Asn He Cys 
335 

Pro Val Ser Glu 
350 
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Ser Gin Pro Val 
355 

lie His His He 
370 

Leu Val Asn Val 
385 



Ser Val Val Lys 

360 

Met Gly Lys Val 
375 

Asp Leu Asp Asp 
390 



Lys Ala Tyr Gin 

Pro Asp Ala Cys 

380 

Cys He Phe Glu 
395 



Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<210> 83 
<211> 536 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep68 263 GCC 



<400> 83 



Thr 


Ala 


Giy 


T-)"L-i g> 


J. 








GlU 


TT 1 _ 

HIS 


Leu 


fro 








20 


Lys 


Glu 


Trp 


Glu 










GlU 


Gin 


Ala 


Pro 










Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


lieu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 



iyr 


VJ-L Li. 






c 










Tip 

lie 


OC1 




T.oi 1 
JjcU 


xri, \J 


XT ±. VJ 










40 


Leu 


Thr 


Val 


Ala 










Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


Ala 


Lys 


Thr 


Lys 


Thr 


Ala 








280 


He 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 


325 








Thr 


Asn 


He 


Ala 


Val 


Asn 


Trp 


Thr 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly lie 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
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355 






360 


fire 

\~y 9 


v dl 




Lys Met Val 


He 


Tro 


370 






375 




Lys 


Val 


Val 


Glu Ser Ala 


Lys 


Ala 


385 






390 






Val 


Asp 


Gin 


Lys Cys Lys 


Ser 


Ser 






405 






He 


Val 


Thr 


Ser Asn Thr 


Asn 


Met 








420 






Thr 


Thr 


Phe 


Glu His Gin 


Gin 


Pro 






435 






440 


Glu 


Leu 


Thr 


Arg Arg Leu Asp 


His 




450 






455 




Glu 


Val 


Lys 


Asp Phe Phe 


Arg 


Trp 


465 




470 






Glu 


His 


Glu 


Phe Tyr Val Lys 


Lys 








485 






Pro 


Ser 


Asp 


Ala Asp He 


Ser 


Glu 








500 






Ala 


Gin 


Pro 


Ser Thr Ser 


Asp 


Ala 






515 






520 


Arg 


Leu 


Ala 


Arg Gly His 


Ser 


Leu 


530 






535 





<210> 84 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 84 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 
35 


Arg 


Ser 


Gin 


Ala 


Lys 
40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 








55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn 


Gly 


Tyr 


Asp 


Pro 
85 


Gin 


Tyr 


Ala 


Thr 


Lys 


Lys 


Phe 
100 


Gly 


Lys 


Arg 


Asn 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 








120 


Phe 


Tyr 
130 


Gly 


Cys 


Val 


Asn 


Trp 
135 


Thr 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 
180 


Cys 


Lys 


Ser 


Ser 


He 


Val 


Thr 
195 


Ser 


Asn 


Thr 


Asn 


Met 
200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 
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365 








Trp 


Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 




380 










He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 






395 








400 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 




410 








415 




Cys 


Ala 


Val He 


Asp 


Gly 


Asn 


Ser 


425 








430 






Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 








445 

mj 








Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




460 










Ala 


Lys 




Val 


Val 


Glu 


Val 












480 


Gly Gly 


nia uy e» 


J— ! jr D 


Aircr 


Pro 


Ala 




490 








495 




Pro 


Lys 


ATy V d-L 








Val 


505 






510 






Glu 


Ala 


Del 11C 


7\ en 
rib 11 


Tvr 


Ala 


Asn 








525 








263 


GCC 












Asp 


Lys 


Glv He 


Thr 


Ser 


Glu 


Lys 


10 








15 




Ser Tyr 


Tie Ser 


Phe 


Asn 


Ala 


Ala 


25 








30 






Ala 


Ala 




Asn 


Ala 


Gly 


Lys 








45 








Pro 


Asp 


iyi jjc *jl 


Val 


Glv 


Gin 


Gin 




60 










Arg 


He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




75 








80 


Ala 


Ser 


Val Phe 


Leu 


Gly 


Trp 


Ala 




90 








95 




Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 


105 






110 






Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 








125 








Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 






140 










Trp 


Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 




155 








160 


He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 




170 






175 


Val 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


185 






190 






Cys 


Ala 


Val He 


Asp 


Gly 


Asn 


Ser 






205 








Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 






220 
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Glu Leu Thr Arg 
225 

Glu Val Lys Asp 

Glu His Glu Phe 

260 

Pro Ser Asp Ala 
275 

Ala Gin Pro Ser 
290 

Arg Leu Ala Arg 
305 



Arg Leu Asp His 
230 

Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp lie Ser Glu 

280 

Thr Ser Asp Ala 
295 

Gly His Ser Leu 
310 



Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser lie 

300 



Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 



<210> 85 

<211> 621 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 264 GCG 



<400> 85 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys Val Pro 


Ser Asp 


Leu Asp 


1 






5 








10 




15 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser Phe Val Asn 


Trp Val 


Ala Glu 








20 








25 


30 




Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp 


Leu Asn 


Leu lie 




35 










40 




45 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys Leu Gin 


Arg Asp 


Phe Leu 




50 










55 




60 






Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro Glu Ala 


Leu Phe 


Phe Val 


65 






70 






75 




80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His Met His 


Val Leu 


Val Glu 








85 








90 




95 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe 


Leu Ser 


Gin He 






100 








105 


110 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr Arg Gly He 


Glu Pro 


Thr Leu 




115 










120 




125 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly 


Ala Gly Gly Gly 




130 








135 




140 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro Asn Tyr 


Leu Leu 


Pro Lys 


145 








150 






155 




160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr Asn Met 


Glu Gin Tyr Leu 










165 






170 




175 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg Lys Arg Leu 


Val Ala 


Gin His 






180 










185 


190 




Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin Asn Lys 


Glu Asn 


Gin Asn 






195 










200 




205 




Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg Ser Lys Thr 


Ser Ala Arg Tyr 




210 








215 




220 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys Gly He 


Thr Ser 


Glu Lys 


225 








230 






235 




240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr He Ser 


Phe Asn 


Ala Ala 








245 






250 




255 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Ala 


Ala Ala Leu Asp 


Asn Ala Gly Lys 








260 










265 


270 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val Gly Gin Gin 






275 










280 




285 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He Tyr Lys 


He Leu 


Glu Leu 




290 








295 




300 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser Val Phe 


Leu Gly Trp Ala 
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305 










310 






Thr - 


Lvs 


Lys 


Phe Gly Lys 


Arg 


Asn 








325 








THt" 


Thr 


Gly Lys 


Thr 


Asn 


lie 


Ala 








340 










Phe 


Tvr 
j 


Gly Cys 


Val 


Asn 


Trp 


Thr 






355 










360 




V Ct A. 


Asp 


Lys 


Met 


Val 


lie 


Trr> 


370 










375 






V ct JL 


Val 


Glu 


Ser 


^V-L CI 


Lvs 


Ala 


J O J 










"3 90 






vai 


ASp 


Gin Lys 


Cys 


j-i_y o 


Cpr 
OCX 


Cpr 

<J w JL. 








405 








lie 


Val 


Thr 


Ser 


Asn 


JL 1XJL 


Asn 

run 


Met 








420 










inx 


inr 


Phe 


Glu 


His 




Gin 








435 










440 


y^t 1 ... 
ulu 


bcU 


Thr Arg Arg 


Leu 




His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 








470 






Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


lie 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 






515 










520 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


530 










535 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 


545 










550 






Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 










565 








Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 








580 










lie 


His 


His 


lie 


Met 


Gly 


Lys 


Val 






595 










600 


Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 




610 










615 





<210> 86 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 



<400> 86 



Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


1 






5 








Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 




20 










Ser 


Asn 


Ser Arg 
35 


Ser 


Gin 


He 


Ala 
40 


He 


Met 


Ser Leu 


Thr 


Lys 


Thr Ala 




50 








55 




Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 


65 








70 






Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 



85 
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315 




320 


Thr 


He 


Trp Leu Phe Gly Pro 


Ala 




330 




335 




Glu 


Ala 


He 


Ala His Thr Val 


Pro 


345 






350 




Asn 


Glu 


Asn 


Phe Pro Phe Asn 


Asp 








365 




Trp 


Glu 


Glu Gly Lys Met Thr 


Ala 






380 




lie 


Leu 


Gly Gly Ser Lys Val 


Arg 






395 




400 


Ala 


Gin 


He 


Asp Pro Thr Pro 


Val 




410 




415 




Cvs 


Ala 


Val 


He Asp Gly Asn 


Ser 


425 






430 




Leu 


Gin 


Asp 


Arg Met Phe Lys 


Phe 








445 




Asp 


Phe 


Gly Lys Val Thr Lys 


Gin 






460 




Ala 


Lys 


Asp 


His Val Val Glu 


Val 




475 




480 


Gly 


Gly 


Ala 


Lys Lys Arg Pro 


Ala 




490 




495 




Pro 


Lys 


Arg 


Val Arg Glu Ser 


Val 


505 






510 




Glu 


Ala 


Ser 


He Asn Tyr Ala 


Asp 








525 




His 


Val 


Gly Met Asn Leu Met 


Leu 








540 




Met 


Asn 


Gin 


Asn Ser Asn He 


Cys 






555 




560 


Leu 


Glu 


Cys 


Phe Pro Val Ser 


Glu 




570 


575 




Lys 


Ala 


Tyr 


Gin Lys Leu Cys 


Tyr 


585 






590 




Pro 


Asp 


Ala 


Cys Thr Ala Cys 


Asp 






605 




Cys 


He 


Phe 


Glu Gin 





620 



264 


GCG 








Asp 


Lys 


Gly He 


Thr Ser Glu 


Lys 




10 




15 




Ser 


Tyr 


He Ser 


Phe Asn Ala 


Ala 


25 




30 




Ala 


Ala 


Leu Asp Asn Ala Gly 


Lys 








45 




Pro 


Asp 


Tyr Leu 


Val Gly Gin 


Gin 






60 






Arg 


He 


Tyr Lys 


He Leu Glu 


Leu 






75 




80 


Ala 


Ser 


Val Phe 


Leu Gly Trp 


Ala 




90 




95 
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X 11X 


Tivs 


Lys 


Phe 








100 






Gly Lys 






115 




lr IXC. 


±yj~ 


Gly Cys 




130 








v ax 


Asp 


Lys 


145 








Lys 


Val 


Val 


Glu 


vai 


ASp 


Gin 


Lys 








180 


lie 


vai 


Thr 


Ser 






195 




Thr 




Phe 


Glu 










Glu 


Leu 


Thr Arg 


225 








Glu 


Val 


Lys 


Asp 


Glu 


His 


Glu 


Phe 








260 


Pro 


Ser 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 


Ser 


Gin 


Pro 


Val 






355 




lie 


His 


His 


He 




370 






Leu 


Val 


Asn 


Val 


385 








<210> 87 




<211> 536 




<212> PRT 




<213> Artificia 



Gly Lys Arg Asn 

Thr Asn He Ala 

120 

Val Asn Trp Thr 
135 

Met Val He Trp 
150 

Ser Ala Lys Ala 
165 

Cys Lys Ser Ser 

Asn Thr Asn Met 

200 

His Gin Gin Pro 
215 

Arg Leu Asp His 
230 

Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp He Ser Glu 

280 

Thr Ser Asp Ala 
295 

Lys Cys Ser Arg 
310 

Gin Cys Glu Arg 
325 

Gin Lys Asp Cys 

Ser Val Val Lys 

360 

Met Gly Lys Val 
375 

Asp Leu Asp Asp 
390 



Sequence 



Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 

His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 

Lys Ala Tyr Gin 

Pro Asp Ala Cys 

380 

Cys He Phe Glu 
395 



Phe Gly Pro Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 

Asn Leu Met Leu 

320 

Ser Asn He Cys 
335 

Pro Val Ser Glu 
350 

Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<220> 

<223> Mutant rep protein: rep68 264 GCG 



<400> 87 












Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


1 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 



He Lys Val Pro Ser Asp Leu Asp 

10 15 
Ser Phe Val Asn Trp Val Ala Glu 
25 30 
Ser Asp Met Asp Leu Asn Leu He 

45 

Glu Lys Leu Gin Arg Asp Phe Leu 

60 

Ala Pro Glu Ala Leu Phe Phe Val 

75 80 
Phe His Met His Val Leu Val Glu 

90 95 
Leu Gly Arg Phe Leu Ser Gin lie 
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100 










Arg 


Glu 


Lys 


Leu 


He 


Gin Arg 


He 




115 










120 

-i- yj 


Pro 


Asn 


Trp 


Phe 


Ala 


V d -L 


Thr 

X 111. 


j-j y s> 




1.3 U 








135 

-X- ~J 




Asn 


Lys 


Val 


Val 


Asp 


VjXU 


<jys 


Tvr 


145 














Thr 


Gin 


Pro 


Glu 


Leu 
165 


Gin 


Trp 


&J.3. 


Ser 


Ala 


Cys 


Leu 
180 


Asn 


Leu 


inr 


LrlU 


Leu 


Thr 


His 
195 


Val 


Ser 


Gin 


inr 


Gin 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


Tl — 

lie 




210 








215 




Met 


mm. T 

Glu 


Leu Val Gly 


Trp 


Leu 


vai 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 
260 


Ser 


Gin 


He 


Ala 


lie 


Met 


Ser 
275 


Leu 


Thr 


Lys 


Thr 


Aia 

2 o U 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 










295 




Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Aid 


305 










310 






Thr 


Lys 


Lys 


Phe Gly 


Lys 


Arg 


Asn 








325 








Thr 


■ml 

Thr 


Gly Lys 


Thr 


Asn 


He 


Aid 








340 










Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


inr 




355 










•J V w 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 




370 










375 




Lys 


Val 


Val 


Glu 


Ser 


Ala Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 
405 


Lys 


Ser 


Ser 


lie 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Pro 
440 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 






515 










520 


Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 


530 










535 





<210> 88 
<211> 312 
<212> PRT 

<213> Artificial Sequence 



105 










110 






Tyr Arg Gly 


He 


Glu 


Pro 


Thr 


Leu 










125 








Thr Arg Asn 


Gly Ala Gly 


Gly 


Gly 








140 










He 


Pro 


^VO 11 


Tvr 


Leu 


Leu 

-X-J M 


Pro 


Lys 






155 








160 


Trp 


Thr 


7\ c? 






VJ 111 


Tyr 


Leu 


170 










175 




Arg Lys 




xjc u. 


Val 


Ala 


Gin 


His 


185 










190 






Glu 


Gin 


Asn 


T .Tro 

j-jy t> 




21 cjtj 
riijli 


Gin 


Asn 








\J ~> 








Arg 


Ser 


Lys 


Xli-L 


OCX. 


Ala 


Aro 


Tyr 

-i 






220 










Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lvs 


235 










240 


Ser, 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 




250 










255 




Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Glv 


Lys 


265 








270 






Pro Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 










285 








Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 






300 










Ala 


Ser 


Val 
315 


Phe 


Leu 


Gly 


Trp 


Ala 
320 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 
365 


Phe 


Asn 


Asp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 






380 










He 


Leu Gly 


Gly 


Ser 


Lys 


Val 


Arg 






395 










400 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 




410 








415 




Cys 


Ala 


Val 


He Asp Gly 


Asn 


Ser 


425 










430 






Leu Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 










445 








Asp 


Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 






460 










Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 




475 










480 


Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 




490 










495 


Val 


Pro Lys Arg 


Val 


Arg 


Glu 


Ser 


505 










510 






Glu 


Ala 


Ser 


He 


Asn Tyr 


Ala 


Asp 










525 
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<220> 

<223> Mutant rep protein: rep40 264 GCG 

Met°Glu 8 Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

1 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Ala Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
65 " 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 105 HO 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

U5 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 " 150 155 160 

Lvs Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

* 165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 I 85 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 ~ 280 285 

Ala Gin Pro Sex Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 89 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 334 GCG 

<400> 89 _ 
Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
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50 




X. XX Jw 


Glu 


Tro ArQ 


65 






Gin 


Phe 


Glu Lys 




XXLX 


Glv Val 

vjxy v ax 






100 

JL> V/ v 


ATy 








115 
-i — * — ^ 




ash 


XXL/ XrxxG 




1 **o 

J- _> VJ 




Asn 


Lys 


VaJL Val 


145 






Thr 


Gin 


Pro Glu 


Ser 


Ala 


Lrys xieu 






■L o V 




inr 


nis Val 






1 Q5 


Pro 


Asn 


ber Asp 




Z±KJ 




Met 


vjJ.U 


lieu val 


225 






Gin 


Trp 


He Gin 


bear 


ASH 


ber Arg 








lie 


MeC 


oer Jjcu 










V o.X 


U1U Ask/ 




290 




7V 




X^fT ASp 


305 






Thr 


Lys 


Lys Phe 


T* Vi *v* 












DVi<» 
xrllfc; 


Tut 

xyr 








355 - 


cys 


vai 


Asp iiys 




370 




Lys 


Val 


Val Glu 


385 






Val 


Asp 


Gin Lys 


He 


Val 


Thr Ser 






420 


Thr 


Thr 


Phe Glu 






435 


Glu 


Leu 


Thr Arg 




450 




Glu 


Val 


Lys Asp 


465 






Glu 


His 


Glu Phe 


Pro 


Ser 


Asp Ala 






500 


Ala 


Gin 


Pro Ser 






515 


Arg 


Tyr 


Gin Asn 


530 





55 

Arg Val Ser Lys 
70 

Gly Glu Ser Tyr 
85 

Lys Ser Met Val 

He Gin Arg He 

120 

Ala Val Thr Lys 
135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 









60 


Ala 


Pro 


Glu 


Ala 






75 




Phe 


His 


Met 


His 




90 






Leu Gly 


Arg 


Phe 


105 








Tyr Arg 


Gly 


lie 


Thr Arg 


Asn 


Glv 








140 


He 


Pro 


Asn 


Tvr 






155 




Trp 


Thr 


Asn 


Met 




170 






Arg 


Lys 


Arci 


Leu 


185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lvs 


Thr 








220 


Asp 


Lys 


Glv 


He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 








Pro Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 






315 




Thr 


He 


Trp 


Leu 




330 






Glu 


Ala 


He 


Ala 


345 








Asn 


Glu 


Asn 


Phe 


Trp 


Glu 


Glu 


Glv 








380 


He 


Leu 


Gly 


Gly 






395 




Ala 


Gin 


He 


Asp 




410 






Cys 


Ala 


Val 


He 


425 








Leu 


Gin 


Asp 


Arg 


Asp 


Phe 


Gly 


Lys 








460 


Ala Lys 


Asp 


His 






475 




Gly Gly 


Ala 


Lys 




490 






Pro 


Lys 


Arg 


Val 


505 








Glu 


Ala 


Ser 


He 


His 


Val 


Gly 


Met 






540 



Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Ala Pro Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 

Asn Leu Met Leu 
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Phe Pro Cys Arg Gin Cys Glu Arg 
545 550 
Phe Thr His Gly Gin Lys Asp Cys 

565 

Ser Gin Pro Val Ser Val Val Lys 

580 

lie His His He Met Gly Lys Val 
595 600 
Leu Val Asn Val Asp Leu Asp Asp 
610 615 

<210> 90 
<211> 397 
<212> PRT 

<213> Artificial Sequence 



Met Asn Gin Asn Ser Asn He Cys 
555 560 
Leu Glu Cys Phe Pro Val Ser Glu 

570 575 
Lys Ala Tyr Gin Lys Leu Cys Tyr 
585 590 
Pro Asp Ala Cys Thr Ala Cys Asp 

605 

Cys He Phe Glu Gin 

620 



<220> 

<223> Mutant rep protein: rep52 334 GCG 



<400> 90 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 








40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 








120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 


His 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 








260 










Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 




290 










295 




Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


305 










310 






Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 



Asp Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


10 








15 




Ser Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 


25 






30 






Ala Ala 


Leu Asp 


Asn Ala Gly 
45 


Lys 


Pro Asp 


Tyr Leu 


Val Gly Gin 


Gin 




60 










Arg He 


Tyr Lys 


He 


Leu 


Glu 


Leu 


75 








80 


Ala Ser 


Val Phe 


Leu Gly Trp 


Ala 


90 








95 




Thr He 


Trp Leu 


Phe 


Ala 


Pro 


Ala 


105 




110 






Glu Ala 


He Ala 


His 


Thr 


Val 


Pro 






125 








Asn Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 




140 










Trp Glu Glu Gly 


Lys 


Met 


Thr 


Ala 




155 








160 


He Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


170 








175 




Ala Gin 


He Asp 


Pro 


Thr 


Pro 


Val 


185 






190 






Cys Ala Val He 


Asp 


Gly Asn 


Ser 






205 








Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 




220 










Asp Phe Gly Lys 


Val 


Thr 


Lys 


Gin 




235 








240 


Ala Lys Asp His 


Val 


Val 


Glu 


Val 


250 








255 




Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 


265 






270 






Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 






285 








Glu Ala 


Ser He 


Asn Tyr 


Ala 


Asp 




300 










His Val 


Gly Met 


Asn 


Leu 


Met 


Leu 




315 








320 


Met Asn 


Gin Asn 


Ser 


Asn 


He 


Cys 
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Phe Thr His Gly 

340 

Ser Gin Pro Val 
355 

lie His His lie 
370 

Leu Val Asn Val 
385 



325 

Gin Lys Asp Cys 

Ser Val Val Lys 

360 

Met Gly Lys Val 
375 

Asp Leu Asp Asp 
390 



330 

Leu Glu Cys Phe 
345 

Lys Ala Tyr Gin 

Pro Asp Ala Cys 

380 

Cys lie Phe Glu 
395 



335 

Pro Val Ser Glu 
350 

Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<210> 91 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 334 GCG 



<400> 91 



Thr 


Ala 


Gly 


Pne 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lys 


Lys 


Phe 



Tyr 


v?JLU 


± j.e 


V d± 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








4a 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 






280 


He 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 



325 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 

95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Ala Pro Ala 
335 
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Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu 






340 










345 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 




355 








360 




Cvs 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


Trp 


370 








375 






Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


He 


385 








390 








Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 






405 










lie 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys 






420 










425 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 


Leu 






435 








440 




Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp 




450 








455 






Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg Trp 


Ala 


465 








470 








Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly 








485 










Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 






500 










505 


Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 


Glu 






515 








520 




Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 




530 








535 







<210> 92 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 334 



<400> 92 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


Asp 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 










25 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 






35 








40 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 










55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


65 










70 








Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 










85 










Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 








100 










105 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






115 










120 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 




130 










135 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


145 










150 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 








165 










Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 








180 










185 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 
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Ala 


He 


Ala 


His 


Thr 


Val 


Pro 










350 






Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 








365 








Glu 


Glu 


Gly Lys 


Met 


Thr 


Ala 






380 










Leu 


Gly Gly Ser 


Lys 


Val 


Arg 




395 










400 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 


410 










415 




Ala 


Val 


He 


Asp 


Gly Asn 


Ser 










430 






Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 








445 








Phe 


Gly Lys Val 


Thr 


Lys 


Gin 






460 










Lvs 


Asp 


His 


Val 


Val 


Glu 


Val 




475 










480 


Glv 

V»J J 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 










495 




Lvs 


Arg 


Val 


Arg 


Glu 


Ser 


Val 










510 






Ala 


Ser 


He 


Asn 


Tyr Ala Asp 








525 








GCG 














Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


10 










15 




Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 








30 






Ala 


Leu Asp 


Asn 


Ala 


Gly Lys 








45 








Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 






60 










He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 




75 










80 


Ser 


Val 


Phe 


Leu 


Gly Trp Ala 


90 










95 




He 


Trp 


Leu 


Phe 


Ala 


Pro 


Ala 








110 






Ala 


He 


Ala 


His 


Thr 


Val 


Pro 








125 








Glu 


Asn 


Phe 


Pro 


Phe 


Asn Asp 






140 










Glu 


Glu Gly Lys 


Met 


Thr 


Ala 




155 










160 


Leu 


Gly Gly Ser 


Lys 


Val 


Arg 


170 










175 




Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 










190 






Ala 


Val 


He Asp 


Gly Asn Ser 
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195 










200 




Thr Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


210 










215 






Glu Leu 


Thr Arg Arg 


Leu 


Asp His 




225 








230 








Glu Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 








245 










Glu His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 






260 








o ez c 


Pro Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 




275 










280 




Ala Gin 


Pro 


Ser 


Thr 


Ser 


Asp Ala 


Glu 


290 










295 






Arg Leu 


Ala Arg 


Gly 


His 


Ser 


Leu 




305 








310 








<210> 93 














<211> 621 














<212> PRT 














<213> Artificial Sequence 






<220> 
















<223> Mutant rep protein: rep78 


*3 xz 
335 


<400> 93 














Thr Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


lie 


1 

Glu His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 






20 










25 


Lys Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


35 










40 




Glu Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


GlU 


' 50 










55 






Thr Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


65 








70 








Gin Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Tit, _ 

Pne 








85 










Thr Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 




100 












Arg Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 


115 










120 




Pro Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


130 








135 






Asn Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


145 








150 








Thr Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 








165 










Ser Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 




180 










185 


Leu Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 




195 










200 




Pro Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


210 








215 






Met Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


225 








230 








Gin Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






245 










Ser Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 






260 










265 


He Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




275 










280 
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205 



Gin 


Asn 
no 


Arg Met 


Phe 


Lvs 


Phe 






£t V* 












Glv 


JjyS 


Val 


Thr 


Lvs 


Gin 




-> zj 










Zi *± \J 


T i* c 
Xiy S 


ASp 


xl 1 S 


val 


val 


Gill 


Val 

V ctl 


OCA 










/ jj 




vjly 


Aid 


Lys 


jjys 


niy 




Ala 

nla 










*>*7 rt 

43 / U 






Jjyb 




Val 


Arg 


ox u. 




Val 

v ax 








285 








Ala 


Ser 


He 


Asn 


iyr 


Aid 


Asp 






300 










GCT 














Lys 


va j. 


Pro 


Ser 


nop 


T oil 

Xjtr U. 




10 










Id 




pne 


vai 


Asn 


Trp 


Veil 


>Vlcl 


olU 








3U 








1*1 ti L. 


Asp 


Leu 


Aqti 


T»£»il 
uc u 


Tie 

1 -I- w 








45 








Lys 


lieu 


Gin Arg 




Phe 








60 










riO 




Ala 


Leu 






Val 

V CI ^ 














80 


111 S 




His 


Val 


1IC u 


Val 


Glu 

W JL W 


on 














Gly 


Arg 


Phe 


Leu 


Ser 


Gin 


He 








110 






Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 




125 








Arg 


Asn 


Gly Ala 


Gly 


Gly 


Gly 






140 










Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 




155 








160 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 










175 




Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 








190 






Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 








205 








Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 






220 










Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


235 










240 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 


250 










255 




Ala 


Leu 


Asp Asn 


Ala 


Gly 


Lys 










270 






Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 



285 
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Pro Val Glu Asp 
290 

Asn Gly Tyr Asp 
305 

Thr Lys Lys Phe 

Thr Thr Gly Lys 

340 

Phe Tyr Gly Cys 
355 

Cys Val Asp Lys 
370 

Lys Val Val Glu 
385 

Val Asp Gin Lys 

He Val Thr Ser 

420 

Thr Thr Phe Glu 
435 

Glu Leu Thr Arg 
450 

Glu Val Lys Asp 
465 

Glu His Glu Phe 

Pro Ser Asp Ala 

500 

Ala Gin Pro Ser 
515 

Arg Tyr Gin Asn 
530 

Phe Pro Cys Arg 
545 

Phe Thr His Gly 

Ser Gin Pro Val 

580 

He His His Tie 
595 

Leu Val Asn Val 
610 



lie Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 



Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 

380 

He Leu Gly Gly 
395 

Ala Gin He Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro' Lys Arg Val 
505 

Glu Ala Ser He 

His Val Gly Met 

540 

Met Asn Gin Asn 
555 

Leu Glu Cys Phe 
570 

Lys Ala Tyr Gin 
585 

Pro Asp Ala Cys 

Cys He Phe Glu 

620 



He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Ala Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asri Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 

Asn Leu Met Leu 

Ser Asn He Cys 

560 

Pro Val Ser Glu 
575 

Lys Leu Cys Tyr 
590 

Thr Ala Cys Asp 

605 

Gin 



<210> 94 
<211> 397 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep52 335 GCT 



<400> 94 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

1 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 " 40 45 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
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65 








Asn 


Gly 


Tyr - Asp 


Thr 


Lys 


Lys 


Phe 








100 


Thr 


Thr 


Gly Lys 






115 




Phe 


Tyr 


Gly 


Cys 




130 






Cys 


Val 


Asp 


Lys 










Lys 


Val 


Val 


Glu 


Val 


Asp 


Gin 


Lys 








180 


He 


Val 


Thr 


Ser 






195 




Thr 


Thr 


Phe 


Glu 




210 






Glu 


Leu 


Thr Arg 


225 








Glu 


Val 


Lys 


Asp 


Glu 


His 


Glu 


Phe 








260 


Pro 


Ser 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 


Ser 


Gin 


Pro 


Val 






355 




He 


His 


His 


He 




370 






Leu 


Val 


Asn 


Val 


385 









70 

Pro Gin Tyr Ala 
85 

Gly Lys Arg Asn 

Thr Asn He Ala 

120 

Val Asn Trp Thr 
135 

Met Val He Trp 
150 

Ser Ala Lys Ala 
165 

Cys Lys Ser Ser 

Asn Thr Asn Met 

200 

His Gin Gin Pro 
215 

Arg Leu Asp His 
230 

Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp He Ser Glu 

280 

Thr Ser Asp Ala 
295 

Lys Cys Ser Arg 
310 

Gin Cys Glu Arg 
325 

Gin Lys Asp Cys 

Ser Val Val Lys 

360 

Met Gly Lys Val 
375 

Asp Leu Asp Asp 
390 











Ala 


Ser 


Val 


Pne 




90 






Thr 


He 


Trp 


▼ — - - 

Leu 


105 








Glu 


Ala 


He 


Ala 


Asn 


Glu 


Asn 


Phe 








140 


Trp 


Glu 


Glu 


Gly 






155 




He 


Leu 


Gly 


Gly 




170 






Ala 


Gin 


He 


Asp 


185 








Cys 


Ala 


Val 


He 


Leu 


Gin 


Asp 


Arg 








220 


Asp 


Phe 


Gly 


Lys 






235 




Ala 


Lys 


Asp 


His 




250 






Gly Gly 


Ala 


Lys 


265 








Pro 


Lys 


Arg 


Val 


Glu 


Ala 


Ser 


He 








300 


His 


Val 


Gly 


Met 






315 




Met 


Asn 


Gin 


Asn 




330 






Leu 


Glu 


Cys 


Phe 


345 








Lys 


Ala 


Tyr 


Gin 


Pro 


Asp 


Ala 


Cys 








380 


Cys 


He 


Phe 


Glu 




395 





80 

Leu Gly Trp Ala 
95 

Phe Gly Ala Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 

Asn Leu Met Leu 

320 

Ser Asn He Cys 
335 

Pro Val Ser Glu 
350 

Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<210> 95 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: 68 335 GCT 



<400> 95 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro Ser Asp 


Leu Asp 


1 






5 










10 






15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn Trp Val Ala 


Glu 








20 








25 






30 




He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 


Met 


Asp Leu Asn 


Leu 




35 










40 








45 






Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin Arg Asp 


Phe 


Leu 




50 










55 










60 






Thr Glu Trp Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala Leu Phe 


Phe 


Val 


65 










70 










75 






80 
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Gln Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 " 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 ~ 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 - 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 * 230 * 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 " 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Ala Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 * 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 



<210> 96 
<211> 312 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 335 GCT 
<400> 96 

r.^u val Glv Tro Leu Val Asp 

10 15 
Ser Tyr lie Ser Phe Asn Ala Ala 
25 30 
Ala Ala Leu Asp Asn Ala Gly Lys 

45 

Pro Asp Tyr Leu Val Gly Gin Gin 

60 

Arg lie Tyr Lys lie Leu Glu Leu 

75 80 
Ala Ser Val Phe Leu Gly Trp Ala 

90 95 
Thr He Trp Leu Phe Gly Ala Ala 
105 *" HO 

Glu Ala He Ala His Thr Val Pro 

125 

Asn Glu Asn Phe Pro Phe Asn Asp 

140 

Trp Glu Glu Gly Lys Met Thr Ala 
155 160 
He Leu Gly Gly Ser Lys Val Arg 

170 175 
Ala Gin He Asp Pro Thr Pro Val 
185 190 
Cys Ala Val He Asp Gly Asn Ser 

205 

Leu Gin Asp Arg Met Phe Lys Phe 

220 

Asp Phe Gly Lys Val Thr Lys Gin 
235 240 
Ala Lys Asp His Val Val Glu Val 

250 255 
Gly Gly Ala Lys Lys Arg Pro Ala 
265 270 
Pro Lys Arg Val Arg Glu Ser Val 

285 

Glu Ala Ser He Asn Tyr Ala Asp 

300 



Met 


Glu 


T.*»ii Val 


Gly Trp 

c 


Leu 


Val 


Gin Trp 


Tie Gin 


Glu 


Asp 


Gin 


Ala 
















Ser 


Asn 


OCX. r\J- y 


Ser 


Gin 


He 


Lvs 














40 


He 


Met 


Dei UC5U. 


Thr 


Lys 


Thr 


Ala 




50 








55 




Pro 


Val 




He 


Ser 


w w J- 


Asn 


65 








70 






Asn Gly 




Pro 


Gin 


Tvr 


Ala 








85 








Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 


Asn 




100 










Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 






115 








120 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 




130 






135 




Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


145 








150 






Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 






165 








Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 






180 










He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






195 








200 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 




210 








215 




Glu 


Leu 


Thr Arg 


Arg 


Leu 


Asp 


His 


225 








230 






Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 








245 








Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 






260 










Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 






275 








280 


Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 




290 








295 




Arg 


Leu 


Ala Arg 


Gly 


His 


Ser 


Leu 


305 








310 







<210> 97 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 337 GCT 

<400> 97 , 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 
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Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 " 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 * 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185. 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235. 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 * 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 ~ 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Ala Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 ' 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 " 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 " 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 ~ ~ 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 
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515 






520 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser Arg 


530 








535 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys Glu Arg 


545 








550 


Phe 


Thr 


His 


Gly 


Gin 


Lys Asp Cys 










565 




Ser 


Gin 


Pro 


Val 


Ser 


Val Val Lys 








580 






He 


His 


His 


He 


Met 


Gly Lys Val 






595 






600 


Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 




610 








615 



<210> 98 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 



<400> 98 



Met 


Glu 


Leu Val 


Gly 
5 


Trp 


Leu 


Val 


1 
Gin 


Trp 


He Gin 
20 


Glu 


Asp 


Gin 


Ala 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 






35 








40 


He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 




50 








55 




Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 


65 








70 




Ala 


Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr 








85 








Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 


Asn 




100 










Ala 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 






115 - 








120 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 




130 






135 




Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


145 








150 




Ala 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 






165 








Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 




180 










He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 




195 








200 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 




210 








215 




Glu 


Leu 


Thr Arg 


Arg 


Leu 


Asp 


His 


225 








230 






Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 






245 








Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 






260 










Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 






275 






280 


Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


290 








295 











525 








His Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 






540 










Met Asn 


Gin 


Asn 


Ser 


Asn 


He 


Cys 




555 










560 


Leu Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 


570 








575 




Lys Ala Tyr Gin 


Lys 


Leu 


Cys 


Tyr 


585 








590 






Pro Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 








605 








Cys He 


Phe 


Glu 


Gin 









620 



337 GCT 



Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


10 










15 




Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 


25 








30 






Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly Lys 










45 








Pro 


Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 






60 










Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 




75 . 










80 


Ala 


Ser 


Val 


Phe 


Leu Gly 


Trp 


Ala 




90 










95 




Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 


105 








110 






Glu 


Ala 


He 


Ala 


His 
125 


Thr 


Val 


Pro 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn Asp 








140 










Trp 


Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 




155 










160 


He 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 




170 










175 




Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 


185 








190 






Cys 


Ala 


Val 


He 


Asp Gly 


Asn 


Ser 








205 








Leu 


Gin 


Asp 


Arg 
220 


Met 


Phe 


Lys 


Phe 


Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




235 










240 


Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 




250 








255 


Ala 


Gly Gly Ala Lys 


Lys Arg 


Pro 


265 










270 






Pro Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 










285 








Glu 


Ala 


Ser 


He 


Asn Tyr 


Ala Asp 



300 
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Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg His 


305 








310 








Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


Met 








325 










Phe 


Thr 


His 


Gly 


Gin Lys 


Asp 


Cys 


Leu 








340 










345 


Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 






355 










360 




lie 


His 


His 


lie 


Met 


Gly 


Lys 


Val 


Pro 




370 










375 






Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp Asp 


Cys 


385 










390 








<210> 99 














<211> 536 














<212> PRT 














<213> Artificial Sequence 






<220> 
















<223> Mutant rep protein: rep 68 33' 


<400> 99 














Thr 


Ala 


Gly 


Phe 


Tyr 
5 


Glu 


He 


Val 


He 


1 

Glu 




Leu 


Pro 


Gly 


lie 


Ser 


Asp 


Ser 








20 










25 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 




35 










40 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 




50 










55 






Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


65 










70 








Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 








85 










Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 






100 










105 


Arg 


Glu 


Lys 


Leu 


lie 


Gin 


Arg 


lie 


Tyr 




115 










120 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 




130 








135 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


145 








150 








Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 










165 










Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 






180 










185 


Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 






195 










200 




Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 




210 








215 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


225 










230 








Gin 


Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 








245 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 








260 










265 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 






275 










280 




Pro 


Val 


Glu 


Asp 


lie 


Ser 


Ser 


Asn 


Arg 




290 








295 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 



PCT/IB02/04087 



-124- 



Val 


Gly Met Asn 


Leu 


Met 


Leu 




315 






320 


Asn 


Gin Asn Ser 


Asn 


He 


Cys 


330 






335 




Glu 


Cys Phe Pro 


Val 


Ser 


Glu 






350 






Ala 


Tyr Gin Lys 


Leu 


Cys 


Tyr 




365 








Asp 


Ala Cys Thr 


Ala 


Cys 


Asp 




380 








He 


Phe Glu Gin 









395 



GCT 



Lys 


Val Pro Ser Asp 


Leu 


Asp 


10 




15 




Phe 


Val Asn Trp Val 


Ala 


Glu 




30 






Asp 


Met Asp Leu Asn 


Leu 


He 


45 






Lys 


Leu Gin Arg Asp 


Phe 


Leu 




60 






Pro 


Glu Ala Leu Phe 


Phe 


Val 




75 




80 


His 


Met His Val Leu 


Val 


Glu 


90 




95 




Gly 


Arg Phe Leu Ser 


Gin 


He 


110 






Arg 


Gly He Glu Pro 


Thr 


Leu 




125 






Arg 


Asn Gly Ala Gly Gly 


Gly 


140 






Pro 


Asn Tyr Leu Leu 


Pro 


Lys 




155 




160 


Thr 


Asn Met Glu Gin Tyr 


Leu 


170 




175 




Lys 


Arg Leu Val Ala 


Gin 


His 


190 






Gin 


Asn Lys Glu Asn 


Gin 


Asn 




205 






Ser 


Lys Thr Ser Ala Arg 


Tyr 




220 






Lys 


Gly He Thr Ser 


Glu 


Lys 


235 




240 


Tyr 


He Ser Phe Asn 


Ala 


Ala 


250 




255 




Ala 


Leu Asp Asn Ala Gly 


Lys 




270 






Asp 


Tyr Leu Val G0,y Gin 


Gin 




285 






He 


Tyr Lys He Leu 


Glu 


Leu 




300 






Ser 


Val Phe Leu Gly Trp 


Ala 
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305 








310 






-L.li.JL 


J-ljr £J 


Lys Phe 


Gly 


Lys 


Arg 


Asn 








325 








Ala 


X ii-L. 


Gly Lys 


Thr 


Asn 


He 


Ala 






340 










Jrllci 


ryr 


Gly Cys 


Val 


Asn Trp 


ThT 

X i 1 1. 






355 








360 

*J Vp/ V/ 


Cys 


val 


Asp Lys 


Met 


Val 


He 


Trp 


370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 








405 








He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






420 










Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 






435 








440 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 




450 








455 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


465 








470 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 








485 








Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 






500 










Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 






515 








520 


Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 




530 








535 





<210> 100 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 100 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin 


Trp 


He 


Gin 
20 


Glu 


Asp 


Gin 


Ala 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 








40 


He 


Met 
50 


Ser 


Leu 


Thr 


Lys 


Thr 
55 


Ala 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 








70 






Asn 


Gly 


Tyr 


Asp 


Pro 
85 


Gin 


Tyr 


Ala 


Thr 


Lys 


Lys 


Phe 
100 


Gly 


Lys 


Arg 


Asn 


Ala 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 








120 


Phe 


Tyr 
130 


Gly 


Cys 


Val 


Asn 


Trp 
135 


Thr 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 









-125- 





315 




320 


Thr 


He Trp Leu 


Phe 


Glv Pro Ala 




330 




335 

*f ^# 




Ala He Ala 


His 


-L ll-L V Ct_L XT 1-\J 


345 






350 


Asn 


Glu Asn Phe 


Pro 


Phe Asn Asp 






365 


Trp 


Glu Glu Gly Lys 


Met Thr Ala 




380 






He 


Leu Gly Gly 


Ser 


Lys Val Arg 




395 




400 


Ala 


Gin He Asp 


Pro 


Thr Pro Val 




410 




415 


Cys 


Ala Val He 


Asp 


Gly Asn Ser 


425 






430 


Leu 


Gin Asp Arg Met 


Phe Lys Phe 






445 




Asp 


Phe Gly Lys 


Val 


Thr Lys Gin 




460 






Ala 


Lys Asp His 


Val 


Val Glu Val 




475 




480 


Gly 


Gly Ala Lys 


Lys 


Arg Pro Ala 




490 




495 


Pro 


Lys Arg Val Arg 


Glu Ser Val 


505 






510 


Glu 


Ala Ser He 


Asn 


Tyr Ala Asp 



525 



337 


GCT 










Asp 


Lys 


Gly 


He 


Thr 


Ser Glu Lys 




10 








15 


Ser 


Tyr He 


Ser 


Phe 


Asn Ala Ala 


25 










30 


Ala 


Ala 


Leu 


Asp Asn 


Ala Gly Lys 










45 




Pro 


Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 








60 






Arg 


He 


Tyr 


Lys 


He 


Leu Glu Leu 






75 






80 


Ala 


Ser 


Val 


Phe 


Leu 


Gly Trp Ala 




90 








95 


Thr 


He 


Trp 


Leu 


Phe 


Gly Pro Ala 


105 








110 


Glu 


Ala 


He 


Ala 


His 


Thr Val Pro 










125 




Asn 


Glu Asn 


Phe 


Pro 


Phe Asn Asp 








140 






Trp 


Glu 


Glu 


Gly Lys 


Met Thr Ala 






155 






160 


He 


Leu Gly Gly 


Ser 


Lys Val Arg 




170 








175 
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225 



Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin He 


Asp 


Pro 


Thr 


Pro 


Val 






180 










185 






190 






Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala Val 


He 


Asp Gly Asn Ser 




195 










200 








205 








Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 


210 










215 








220 








Deu Thr Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 


Val 








230 








235 








240 


Lys Asp 


Phe 


Phe 


Arg Trp Ala 


Lys Asp 


His 


Val 


Val 


Glu 


Val 


His 






245 










250 








255 




Glu 


Phe 


Tyre- 


Val 


Lys 


Lys 


Gly Gly Ala 


Lys 


Lys Arg 


Pro 


Ala 






260 










265 








270 






Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 


Gin 


275 










280 








285 








Pro 


Ser 


Thr 


Ser 


Asp Ala Glu Ala Ser 


He 


Asn 


Tyr Ala Asp 


290 










295 








300 










Leu 


Ala 


Arg 


Gly His 


Ser 


Leu 
























310 





















305 

<210> 101 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 78 341 GCC 
<400> 101 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 io 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 ~ 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
"5 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 * 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
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260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu VaX Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 " 295 3°° 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
. ft c 310 315 -*^ u 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 
Thr Thr Gly Lys Ala Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

•a*7f\ 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
■*fl5 390 395 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 
He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 4 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

Glu vll Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

470 475 
Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 - 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 
- 520 525 



500 505 
Thr Ser Asp Ala Glu 

515 520 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
- -1- 535 540 

Gin Cys Glu Arg Met Asn Gin 

545 550 555 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr. 

580 585 5 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 - 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 102 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 341 GCC 

Met°Glu°Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

1 5 10 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
35 40 45 



530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

_ „ _ eern 555 I>oU 
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Ile Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys He Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 105 no 

Thr Thr Gly Lys Ala Asn He Ala Glu Ala He Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 * 160 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 * 270 

. Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
275 280 ' 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 365 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
385 390 " 395 

<210> 103 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 341 GCC 
<400> 103 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 io is 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
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50 








55 




xnxr 


Glu Trp Arg Arg Val 


Ser 


Lys 










70 






pi M 


Phe Glu Lys Gly Glu 


Ser Tyr 








85 








ixir 


Thr 


Gly Val 


Lys 


Ser 


Met 


Val 






100 










Arg 


Glu 


Lys Leu 


He 


Gin 


Arg 


He 




115 








120 


Pro 


Asn Trp Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


145 








150 






rnl 

Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp Ala 








165 








Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His Val 


Ser 


Gin 


Thr 


Gin 






195 








200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




210 






215 




Met 


Glu 


Leu Val 


Gly Trp 


Leu 


Val 


o ■*> c 
225 








230 






Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 






245 








Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 






260 










He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 






275 






280 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 






295 




As ii 


Gly Tyr Asp 


Pro 


Gin 


Tyr Ala 










310 








Lys 


Lys Phe Gly Lys 


Arg Asn 








325 








inr 


Thr Gly Lys Ala Asn 


He 


Ala 






340 










pne 


Tyr Gly Cys 


Val 


Asn 


Trp 


Thr 






355 " 








360 


Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 




370 








375 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


385 








390 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 








405 








He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 






420 










Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 






435 








440 


Glu 


Leu Thr Arg Arg Leu 


Asp 


His 




450 








455 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg Trp 


465 








470 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 








485 








Pro 


Ser Asp Ala Asp 


He 


Ser 


Glu 






500 










Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 






515 








520 


Arg 


Leu Ala Arg Gly His 


Ser 


Leu 




530 








535 
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60 



Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe Val 






75 








80 


Phe 


His 


Met 


His 


Val 


Leu 


Val Glu 




90 










95 




Glv 


Arg 


Phe 


Leu 


Ser 


Gin He 












110 




±yr 




Gly 


He 


Glu 


Pro 


Thr Leu 










125 






XI2J. 


«rg 


Asn Gly 


Ala Gly Gly Gly 








140 








Tl c 

lie 


Fro 


Asn 


Tyr 


Leu 


Leu 


Pro Lys 






155 








160 


Trp 


inr 


Asn 


Met 


Glu 


Gin 


Tyr Leu 


1 / u 










175 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin His 


lob 










190 




ulU 


Id XII 


Asn 


Lys 


Glu 


Asn 


Gin Asn 










205 








oer 


Lys 


Thr 


Ser Ala Arg Tyr 








220 








i\sp 


i»ys 


Gly 


He 


Thr 


Ser 


Glu Lys 






235 








240 




iyr 


He 


Ser 


Phe 


Asn 


Ala Ala 




OCA 










255 


nla 


nla 


Leu 


Asp 


Asn Ala Gly Lys 


*"> £. C 










270 




rXU 




Tyr 


Leu 


Val 


Gly Gin Gin 










285 








He 


Tyr 


Lys 


He 


Leu 


Glu Leu 








300 








rti. a. 


OCX. 


Val 


Phe 


Leu Gly Trp Ala 






315 








320 


Thir 


He 

JU. *Ap V— 


Trp 


Leu 


Phe Gly 


Pro Ala 












335 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val Pro 


345 










350 




Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn Asp 










365 






Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr Ala 








380 








He 


Leu 


Gly Gly 


Ser 


Lys 


Val Arg 






395 








400 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro Val 




410 








415 


Cys 


Ala 


Val 


He 


Asp Gly Asn Ser 


425 










430 




Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys Phe 










445 






Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys Gin 








460 








Ala 


Lys 


Asp 


His 


Val 


Val 


Glu Val 




475 








480 


Gly 


Gly 


Ala 


Lys 


Lys 


Arg 


Pro Ala 




490 










495 


Pro 


Lys 


Arg 


Val 


Arg 


Glu 


Ser Val 


505 










510 




Glu 


Ala 


Ser 


He 


Asn Tyr Ala Asp 



525 
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<210> 104 
<211> 312 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep40 341 GCC 



<400> 104 
Met Glu Leu Val 
1 

Gin Trp lie Gin 

20 

Ser Asn Ser Arg 
35 

lie Met Ser Leu 
50 

Pro Val Glu Asp 
65 

Asn Gly Tyr Asp 

Thr Lys Lys Phe 

100 

Thr Thr Gly Lys 
115 

Phe Tyr Gly Cys 
130 

Cys Val Asp Lys 
145 

Lys Val Val Glu 

Val Asp Gin Lys 

180 

lie Val Thr Ser 
195 

Thr Thr Phe Glu 
210 

Glu Leu Thr Arg 
225 

Glu Val Lys Asp 

Glu His Glu Phe 

260 

Pro Ser Asp Ala 
275 

Ala Gin Pro Ser 
290 

Arg Leu Ala Arg 
305 



Gly Trp Leu Val 
5 

Glu Asp Gin Ala 

Ser Gin lie Lys 

40 

Thr Lys Thr Ala 
55 

lie Ser Ser Asn 
70 

Pro Gin Tyr Ala 
85 

Gly Lys Arg Asn 

Ala Asn lie Ala 

120 

Val Asn Trp Thr 
135 

Met Val He Trp 
150 

Ser Ala Lys Ala 
165 

Cys Lys Ser Ser 

Asn Thr Asn Met 

200 

His Gin Gin Pro 
215 

Arg Leu Asp His 
230 

Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp He Ser Glu 

280 

Thr Ser Asp Ala 
295 

Gly His Ser Leu 
310 



Asp Lys Gly He 
10 

Ser Tyr He Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 

60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 



Thr Ser Glu Lys 
15 

Phe Asn Ala Ala 
30 

Asn Ala Gly Lys 
45 

Val Gly Gin Gin 

He Leu Glu Leu 

80 

Leu Gly Trp Ala 
95 

Phe Gly Pro Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 



<210> 105 
<211> 621 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep78 342 GCC 
<400> 105 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
15 10 15 
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20 — — , Tl 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 



Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

25 -* u 
Ser Asp Met As 

Glu Lys Leu Gl 

50 55 60 



Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 yb 

Glv Val Lys Ser Met Val Leu Gly 

100 105 
Lys Leu He Gin Arg He Tyr Arg 

115 120 
Trp Phe Ala Val Thr Lys Thr Arg 
110 135 — 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

- — 105 

Tyr Arg Gly He Glu Pro 

125 

Pro Asn ir£ Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

135 140 



Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

Lys Arg Leu Val Ala Gin 

190 

Gin Asn Lys Glu Asn Gin 
xy s — ~ 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

215 220 



Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 
Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 



145 150 155 

Met llu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 

111 Trp He Gin Glu A^p Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

075 280 285 

Pro Val Glu Asp lie Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

105 310 315 _ «. 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
Thr Thr Gly Lys Thr Ala He Ala Glu Ala He Ala His Thr Val Pro 

Asn Glu Asn Phe 

360 365 
Trp Glu Glu Gly 

170 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
,oc 390 395 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 



340 — 
Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 



Cys Val Asp Lys Met Val He Tr£ Trp Glu Glu Gly Lys Met Thr Ala 



He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 
Glu His Gin Gin Pro 
435 440 
Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 



420 425 „. 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 



450 455 460 , „, „ , 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

Glu His Glu Phe Tyr vll Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 
Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
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500 505 . 510 
Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 

545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 " 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 ~ 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 106 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 342 GCC 
<400> 106 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

1 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 " 60 

Pro Val Glu Asp He Ser Ser- Asn Arg He Tyr Lys He Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 105 ~ 110 

Thr Thr Gly Lys Thr Ala He Ala Glu Ala He Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 " ~ 160 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

165 170 " * " 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 185 * 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
275 280 285 
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Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 


Ser 


Gin 


Pro 


Val 






355 




lie 


His 


His 


lie 




370 






Leu 


Val 


Asn 


Val 



385 



Thr Ser Asp Ala 
295 

Lys Cys Ser Arg 
310 

Gin Cys Glu Arg 
325 

Gin Lys Asp Cys 

Ser Val Val Lys 

360 

Met Gly Lys Val 
375 

Asp Leu Asp Asp 
390 



Glu Ala Ser lie 

300 

His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 

Lys Ala Tyr Gin 

Pro Asp Ala Cys 

380 

Cys lie Phe Glu 
395 



Asn Tyr Ala Asp 

Asn Leu Met Leu 

320 

Ser Asn lie Cys 
335 

Pro Val Ser Glu 
350 

Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<210> 107 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 342 GCC 
<400> 107 

Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val* Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 ~ 120 ~ 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 ^ 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 "* 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 - 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 " 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
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290 








Asn Gly 


Tyr 


Asp 


Pro 


305 










Thr Lys 


Lys 


Phe 


Gly 










325 


Thr 


Thr 


Gly Lys 


Thr 








340 




Phe 


Tyr 


Gly Cys 


Val 






355 






Cys 


Val 


Asp 


Lys 


Met 




370 








Lys 


Val 


Val 


Glu 


Ser 


385 










Val 


Asp 


Gin 


Lys 


Cys 








405 


He 


Val 


Thr 


Ser 


Asn 








420 




Thr 


Thr 


Phe 


Glu 


His 






435 






Glu 


Leu 


Thr Arg 


Arg 




450 








Glu 


Val 


Lys 


Asp 


Phe 


465 










Glu 


His 


Glu 


Phe 


Tyr 










485 


Pro 


Ser 


Asp 


Ala 


Asp 








500 




Ala 


Gin 


Pro 


Ser 


Thr 






515 






Arg 


Leu 


Ala Arg 


Gly 




530 









295 

Gin Tyr Ala Ala Ser Val 
310 315 
Lys Arg Asn Thr He Trp 

330 

Ala He Ala Glu Ala He 

345 

Asn Trp Thr Asn Glu Asn 
360 

Val He Trp Trp Glu Glu 
375 

Ala Lys Ala He Leu Gly 
390 395 
Lys Ser Ser Ala Gin He 

410 

Thr Asn Met Cys Ala Val 

425 

Gin Gin Pro Leu Gin Asp 
440 

Leu Asp His Asp Phe Gly 
455 

Phe Arg Trp Ala Lys Asp 
470 475 
Val Lys Lys Gly Gly Ala 

490 

He Ser Glu Pro Lys Arg 

505 

Ser Asp Ala Glu Ala Ser 
520 

His Ser Leu 
535 



300 

Phe Leu Gly Trp Ala 

320 

Leu Phe Gly Pro Ala 

335 

Ala His Thr Val Pro 
350 

Phe Pro Phe Asn Asp 
365 

Gly Lys Met Thr Ala 
380 

Gly Ser Lys Val Arg 

400 

Asp Pro Thr Pro Val 

415 

He Asp Gly Asn Ser 
430 

Arg Met Phe Lys Phe 
445 

Lys Val Thr Lys Gin 
460 

His Val Val Glu Val 

480 

Lys Lys Arg Pro Ala 

495 

Val Arg Glu Ser Val 
510 

He Asn Tyr Ala Asp 
525 



<210> 108 
<211> 312 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant 'rep protein: rep40 342 GCC 



<400> 108 



Met 


Glu 


Leu 


Val 


1 








Gin 


Trp 


He 


Gin 






20 


Ser 


Asn 


Ser 


Arg 






35 




He 


Met 


Ser 


Leu 




50 






Pro 


Val 


Glu 


Asp 


65 








Asn 


Gly 


Tyr 


Asp 


Thr 


Lys 


Lys 


Phe 








100 


Thr 


Thr 


Gly 


Lys 






115 




Phe 


Tyr 


Gly 


Cys 




130 






Cys 


Val 


Asp 


Lys 



145 



Gly 


Trp 


Leu 


Val 


5 








Glu 


Asp 


Gin 


Ala 


Ser 


Gin 


He 


Lys 








40 


Thr 


Lys 


Thr 


Ala 




55 




He 


Ser 


Ser 


Asn 




70 






Pro 


Gin 


Tyr 


Ala 


85 








Gly 


Lys 


Arg 


Asn 


Thr 


Ala 


He 


Ala 








120 


Val 


Asn 


Trp 


Thr 






135 




Met 


Val 


He 


Trp 



ISO 



Asp Lys Gly He 
10 

Ser Tyr He Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 

60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 



Thr Ser Glu Lys 
15 

Phe Asn Ala Ala 
30 

Asn Ala Gly Lys 
45 

Val Gly Gin Gin 

He Leu Glu Leu 

80 

Leu Gly Trp Ala 
95 

Phe Gly Pro Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 
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Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 " 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 " 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly His Ser Leu 
305 ~ 310 

<210> 109 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 347 GCA 
<400> 109 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 ~ 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 ~ 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 " 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
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245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 ~ 310 315 ~ 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 ~* 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala Ala Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 ~ ~ 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 ~ 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 " 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 * " 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

580 585 590 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 110 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 347 GCA 
<400> 110 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

15 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 
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Ser 


Asn 


Ser Arg 


Ser 


Gin 


He Lys 


Ala Ala Leu Asp 


Asn Ala Gly Lys 






35 






40 




45 








lie 


Met 


Ser Leu 


Thr 


Lys 


Thr Ala 


Pro Asp Tyr Leu 


Val Gly Gin Gin 




50 








55 


60 










Pro 


Val 


Glu Asp 


He 


Ser 


Ser Asn Arg He Tyr Lys 


He 


Leu 


Glu 


Leu 


65 








70 




75 








80 


Asn Gly Tyr Asp 


Pro 


Gin Tyr Ala Ala Ser Val Phe 


Leu Gly Trp Ala 








85 






90 






95 




Thr Lys 


Lys Phe Gly 


Lys Arg Asn Thr He Trp Leu 




Gly 


Pro 


Ala 






100 








105 




110 






Thr 


Thr 


Gly Lys 


Thr 


Asn 


He Ala 


Glu Ala Ala Ala 


TT-5 — 

xi i s 


Thr 


Val 


Pro 






115 






120 




Tie 








Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp Thr 


Asn Glu Asn Phe 


rTO 


Phe 


Asn 


Asp 




130 








135 


140 










Cys 


Val 


Asp Lys 


Met 


Val 


He Trp Trp Glu Glu Gly 


is 


Met 


Thr 


Ala 


145 








150 




155 








160 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys Ala 


He Leu Gly Gly 


^ A- 


Lys 


Val 


Arg 








165 






170 






175 




Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser Ser 


Ala Gin He Asp 


Pro 


Thr 


Pro 


Val 






180 








185 




190 






lie 


Val 


Thr Ser 


Asn 


Thr 


Asn Met 


Cys Ala Val He 


Asp 


Gly Asn Ser 






195 






200 




205 








Thr 


Thr 


Phe Glu 


His 


Gin Gin Pro Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 




210 








215 


220 










Glu 


Leu Thr Arg Arg 


Leu 


Asp His 


Asp Phe Gly Lys 


Val 


Thr 


Lys 


Gin 


225 








230 




235 








240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg Trp 


Ala Lys Asp His 


Val 


Val 


Glu 


Val 








245 






250 






255 




Glu 


His 


Glu Phe 


Tyr 


Val 


Lys Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 






260 








265 




270 






Pro 


Ser Asp Ala Asp 


He 


Ser Glu 


Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 






275 






280 




285 








Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp Ala 


Glu Ala Ser He 


Asn 


Tyr 


Ala 


Asp 




290 








295 


300 










Arg Tyr Gin Asn Lys 


Cys 


Ser Arg 


His Val Gly Met 


Asn 


Leu 


Met 


Leu 


305 








310 




315 








320 


Phe 


Pro 


Cys Arg 


Gin 


Cys 


Glu Arg Met Asn Gin Asn 


Ser 


Asn 


He 


Cys 








325 






330 






335 




Phe Thr His Gly Gin 


Lys 


Asp Cys 


Leu Glu Cys Phe 


Pro 


Val 


Ser 


Glu 






340 








345 




350 






Ser 


Gin 


Pro Val 


Ser 


Val 


Val Lys 


Lys Ala Tyr Gin 


Lys 


Leu Cys 


Tyr 






355 






360 




365 








He 


His 


His He 


Met 


Gly Lys Val 


Pro Asp Ala Cys 


Thr 


Ala 


Cys Asp 




370 








375 


380 










Leu 


Val 


Asn Val 


Asp 


Leu Asp Asp 


Cys He Phe Glu 


Gin 









385 390 395 



<210> 111 
<211> 536 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep68 347 GCA 
<400> 111 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 * 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
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35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 " 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 " 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 ~ 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 * 230 ~ 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 ~ 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys lie Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 ~ " 330 " * 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala Ala Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 ~ " 455 ~ 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 " 490 " 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 ~ 505 ~ 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 
515 52 0 525 
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Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 112 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 112 



Met 


Glu 


Leu Val 


Gly 


Trn 


ucu v ax 


1 






5 






Gin 


Trp 


He Gin 


Glu 


TV C3T"\ 


Gin Ala 




20 








Ser 


Asn 


Ser Arg 


Ser 


GIti 


T 1 f* TiVQ 

J- J- 5— JLJjr E> 






35 






40 


He 


Met 


Ser Leu 


Thr 




THt* Ala 




50 








55 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser Asn 


65 








70 




Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr Ala 








85 






Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg Asn 






100 








Thr 


Thr 


Gly Lys 


Thr 


Asn 


He Ala 






115 






120 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp Thr 




130 








135 


Cys 


Val 


Asp Lys 


Met 


Val 


He Trp 


145 








150 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys Ala 






165 






Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser Ser 






180 








He 


Val 


Thr Ser 


Asn 


Thr 


Asn Met 






195 






200 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin Pro 




210 








215 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp His 


225 








230 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg Trp 








245 






Glu 


His 


Glu Phe 


Tyr 


Val 


Lys Lys 






260 








Pro 


Ser 


Asp Ala Asp 


He 


Ser Glu 






275 






280 


Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp Ala 




290 








295 


Arg 


Leu 


Ala Arg Gly 


His 


Ser Leu 


305 








310 





<210> 113 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



-139- 



347 


GCA 










Asp 


Lys 


Gly 


He 


Thr 


c«a-»- Glu Lv<5 




10 










Ser Tyr 


He 


Ser 


Phe 


A^ti Ala Ala 


25 










n 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala Glv Lvs 










45 




Pro 


Asp 


Tyr 


Leu 


Val 


Gl v Gl n Gl n 
vjxy uiii vjxii 








60 






Arg 


He 


Tyr 


Lys 


He 


Leu Glu Leu 






75 






80 


Ala 


Ser 


Val 


Phe 


Leu 


Gly Trp Ala 




90 








95 


Thr 


He 


Trp 


Leu 


Phe 


Gly Pro Ala 


105 










110 


Glu 


Ala 


Ala 


Ala 


His 


Thr Val Pro 










125 




Asn 


Glu 


Asn 


Phe 


Pro 


Phe Asn Asp 








140 






Trp 


Glu 


Glu 


Gly Lys 


Met Thr Ala 






155 






160 


He 


Leu Gly 


Gly 


Ser 


Lys Val Arg 




170 








175 


Ala 


Gin 


He 


Asp 


Pro 


Thr Pro Val 


185 










190 


Cys 


Ala 


Val 


He 


Asp 


Gly Asn Ser 








205 




Leu 


Gin 


Asp 


Arg 


Met 


Phe Lys Phe 








220 






Asp 


Phe Gly 


Lys 


Val 


Thr Lys Gin 






235 






240 


Ala 


Lys 


Asp 


His 


Val 


Val Glu Val 




250 








255 


Gly Gly Ala 


Lys 


Lys 


Arg Pro Ala 


265 










270 


Pro 


Lys 


Arg 


Val 


Arg 


Glu Ser Val 










285 




Glu 


Ala 


Ser 


He 


Asn 


Tyr Ala Asp 








300 







<220> 

<223> Mutant rep protein: rep 78 350 AAT 
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<400> 113 


Thr 


Ala 


Gly Phe 


1 






Glu 


His 


Leu Pro 






20 


Lys 


Glu 


Trp Glu 






35 


Glu 


Gin 


Ala Pro 




50 




Thr 


Glu 


Trp Arg 








Gin 


Phe 


Glu Lys 


Thr 


Thr 


Gly Val 






100 


Arg 


Glu 


Lys Leu 






115 


Pro 


Asn 


Trp Phe 




130 




Asn 


Lys 


Val Val 


14c 




Thr 


Gin 


Pro Glu 


Ser 


Ala 


Cys Leu 






180 


Leu 


Thr 


His Val 






195 


Pro 


Asn 


Ser Asp 




210 




Met 


Glu 


Leu Val 


225 






Gin 


Trp 


lie Gin 


Ser 


Asn 


Ser Arg 






260 


lie 


Met 


Ser Leu 






275 


Pro 


Val 


Glu Asp 




290 


- 


Asn 


Gly 


Tyr Asp 


305 






Thr 


Lys 


Lys Phe 


Thr 


Thr 


Gly Lys 






340 


Phe 


Tyr 


Gly Cys 






355 


Cys 


Val 


Asp Lys 




370 




Lys 


Val 


Val Glu 


385 






Val 


Asp 


Gin Lys 


lie 


Val 


Thr Ser 






420 


Thr 


Thr 


Phe Glu 






435 


Glu 


Leu 


Thr Arg 




450 




Glu 


Val 


Lys Asp 


465 







Tyr Glu lie Val 
5 

Gly lie Ser Asp 

Leu Pro Pro Asp 

40 

Leu Thr Val Ala 
55 

Arg Val Ser Lys 
70 

Gly Glu Ser Tyr 
85 

Lys Ser Met Val 

lie Gin Arg lie 

120 

Ala Val Thr Lys 
135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val lie 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin lie Lys 

Thr Lys Thr Ala 

280 

lie Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn lie Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 

• 250 
Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 

380 

He Leu Gly Gly 
395 

Ala Gin He Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 

460 

Ala Lys Asp His 
475 



Ser 


Asp 


Leu 


Asp 






15 




Trp 


Val 


Ala 


Glu 


30 






Leu 


Asn 


Leu 


He 


45 








Arg 


Asp 


Phe 


Leu 


Leu 


Phe 


Phe 


Val 








80 


Val 


Leu 


Val 


Glu 






95 




Leu 


Ser 


Gin 


He 




110 






Glu 


Pro 


Thr 


Leu 


-La J 








Ala 


Gly 


Gly 


Gly 


Leu 


Leu 


Pro 


Lys 








160 


Glu 


Gin 


Tyr 


Leu 






175 




Val 


Ala 


Gin 


His 




190 






Glu 


Asn 


Gin 


Asn 


205 








Ser 


Ala 


Arg 


Tyr 


Thr 


Ser 


Glu 


Lys 








240 


Phe 


Asn 


Ala 


Ala 






255 




Asn 


Ala 


Gly 


Lys 




270 






Val 


Gly 


Gin 


Gin 


285 








He 


Leu 


Glu 


Leu 


Leu 


Gly 


Trp 


Ala 








320 


Phe 


Gly 


Pro 


Ala 






335 




His 


Asn 


Val 


Pro 




350 






Pro 


Phe 


Asn 


Asp 


365 








Lys 


Met 


Thr 


Ala 


Ser 


Lys 


Val 


Arg 








400 


Pro 


Thr 


Pro 


Val 






415 




Asp 


Gly 


Asn 


Ser 




430 






Met 


Phe 


Lys 


Phe 


445 








Val 


Thr 


Lys 


Gin 


Val 


Val 


Glu 


Val 








480 



WO 03/018820 



PCT/IB02/04087 



-141- 



Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 






485 








490 






495 




Pro 


Ser 


Asp Ala Asp 


lie 


Ser 


Glu 


Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 






500 








505 




510 






Ala 


Gin 


Pro Ser Thr 


Ser Asp 


Ala 


Glu Ala Ser lie 


Asn 


Tyr Ala Asp 






515 






520 




525 








Arg 


Tyr 


Gin Asn Lys 


Cys 


Ser Arg His Val Gly Met 


Asn 


Leu 


Met 


Leu 




530 






535 




540 










Phe 


Pro 


Cys Arg Gin 


Cys 


Glu Arg Met Asn Gin Asn 


Ser 


Asn 


lie 


Cys 


545 




550 






555 








560 


Phe 


Thr 


His Gly Gin 


Lys 


Asp 


Cys 


Leu Glu Cys Phe 


Pro 


Val 


Ser 


Glu 






565 








570 






575 




Ser 


Gin 


Pro Val Ser 


Val 


Val 


Lys 


Lys Ala Tyr Gin 


Lys 


Leu 


Cys 


Tyr 






580 








585 




590 






lie 


His 


His lie Met 


Gly Lys 


Val 


Pro Asp Ala Cys 


Thr 


Ala 


Cys 


Asp 






595 






600 




605 








Leu 


Val 


Asn Val Asp 


Leu 


Asp 


Asp 


Cys lie Phe Glu 


Gin 










610 






615 




620 











<210> 114 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 350 AAT 
<400> 114 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val Asp Lys Gly He 


Thr 


Ser Glu Lys 


1 








5 






10 




15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala Ser Tyr He Ser 


Phe 


Asn Ala Ala 






20 








25 




30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys Ala Ala Leu Asp 


Asn Ala Gly Lys 






35 








40 


45 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala Pro Asp Tyr Leu 


Val 


Gly Gin Gin 




50 










55 


60 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg He Tyr Lys 


He 


Leu Glu Leu 


65 








70 




75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala Ala Ser Val Phe 


Leu Gly Trp Ala 










85 






90 




95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn Thr He Trp Leu 


Phe Gly Pro Ala 








100 








105 




110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala Glu Ala He Ala 


His 


Asn Val Pro 






115 








120 


125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr Asn Glu Asn Phe 


Pro 


Phe Asn Asp 




130 










135 


140 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp Trp Glu Glu Gly 


Lys 


Met Thr Ala 


145 










150 




155 




160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala He Leu Gly Gly 


Ser 


Lys Val Arg 








165 






170 




175 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser Ala Gin He Asp 


Pro 


Thr Pro Val 








180 








185 




190 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met Cys Ala Val He 


Asp 


Gly Asn Ser 






195 










200 


205 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp Arg 


Met 


Phe Lys Phe 




210 










215 


220 






Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His Asp Phe Gly Lys 


Val 


Thr Lys Gin 


225 










230 




235 




240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp Ala Lys Asp His 


Val 


Val Glu Val 










245 






250 




255 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys Gly Gly Ala Lys 


Lys 


Arg Pro Ala 
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260 265 270 

Pro Ser Asp Ala Asp lie Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 365 

lie His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
385 390 395 

<210> 115 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 350 AAT 
<400> 115 

Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn. Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65. 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 ~ 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 
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lie 


Met 


Ser Leu Thr 


Lys Thr 


Ala 


Pro Asp Tyr Leu 


Val 


Gly Gin Gin 






275 




280 




285 




Pro 


Val 


Glu Asp lie 


Ser Ser 


Asn Arg He Tyr Lys 


He 


Leu Glu Leu 




290 




295 




300 






Asn 


Gly 


Tyr Asp Pro 


Gin Tyr 


Ala 


Ala Ser Val Phe 


Leu Gly Trp Ala 


305 


310 




315 




320 


Thr 


Lys 


Lys Phe Gly Lys Arg 


Asn 


Thr He Trp Leu 


Phe 


Gly Pro Ala 




325 






330 




335 


Thr 


Thr 


Gly Lys Thr 


Asn lie 


Ala 


Glu Ala lie Ala 


His 


Asn Val Pro 






340 






345 




350 


Phe 


Tyr 


Gly Cys Val Asn Trp 


Thr 


Asn Glu Asn Phe 


Pro 


Phe Asn Asp 




355 




360 




365 




Cys 


Val 


Asp Lys Met 


Val lie 


Trp Trp Glu Glu Gly 


Lys 


Met Thr Ala 


370 


375 




380 






Lys 


Val 


Val Glu Ser 


Ala Lys 


Ala 


He Leu Gly Gly 


Ser 


Lys Val Arg 


385 






390 




395 




400 


Val 


Asp 


Gin Lys Cys 


Lys Ser 


Ser 


Ala Gin He Asp 


Pro 


Thr Pro Val 




405 






410 




415 


lie 


Val 


Thr Ser Asn 


Thr Asn 


Met 


Cys Ala Val He 


Asp 


Gly Asn Ser 






420 






425 




430 


Thr 


Thr 


Phe Glu His 


Gin Gin 


Pro 


Leu Gin Asp Arg 


Met 


Phe Lys Phe 






435 




440 




445 




Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His Asp Phe Gly Lys 


Val 


Thr Lys Gin 




450 




455 




460 






Glu 


Val 


Lys Asp Phe 


Phe Arg 


Trp 


Ala Lys Asp His 


Val 


Val Glu Val 


465 






470 




475 




480 


Glu 


His 


Glu Phe Tyr 


Val Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg Pro Ala 






485 






490 




495 


Pro 


Ser 


Asp Ala Asp 


He Ser 


Glu 


Pro Lys Arg Val 


Arg 


Glu Ser Val 






500 






505 




510 


Ala 


Gin 


Pro Ser Thr 


Ser Asp 


Ala 


Glu Ala Ser He 


Asn 


Tyr Ala Asp 






515 


520 




525 




Arg 


Leu 


Ala Arg Gly His Ser 


Leu 










530 




535 











<210> 116 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Mutant rep protein: rep40 350 AAT 



<400> 116 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys 


Gly 


He 


Thr Ser Glu Lys 


1 








5 








10 






15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser 


Phe Asn Ala Ala 






20 








25 






30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala 


Leu Asp 


Asn Ala Gly Lys 






35 










40 








45 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val Gly Gin Gin 




50 








55 








60 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg He 


Tyr 


Lys 


He Leu Glu Leu 


65 










70 








75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser 


Val 


Phe 


Leu Gly Trp Ala 








85 








90 






95 


Thr 


Lys 


Lys 


Phe 
100 


Gly 


Lys 


Arg 


Asn 


Thr He 
105 


Trp 


Leu 


Phe Gly Pro Ala 
110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu Ala 


He 


Ala 


His Asn Val Pro 






115 








120 








125 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu 


Asn 


Phe 


Pro Phe Asn Asp 
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130 










135 


140 








Cys Val 


Asp 


Lys 


Met 


Val 


He Trp Trp Glu Glu Gly Lys 


Met Thr 


Ala 


145 








150 




155 






160 


Lys Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala He Leu Gly Gly 


Ser 


Lys Val 


Arg 








165 






170 




175 




Val Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser Ala Gin He Asp 


Pro 


Thr Pro 


Val 






180 








185 




190 




He Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met Cys Ala Val He 


Asp 


Gly Asn Ser 




195 










200 


205 






Thr Thr 


Phe 


Glu 


His 


Gin 


Gin 


.Pro Leu Gin Asp Arg Met 


Phe Lys 


Phe 


210 










215 


220 






Glu Leu 


Thr Arg Arg 


Leu 


Asp 


His Asp Phe Gly Lys 


Val 


Thr Lys 


Gin 


225 








230 




235 






240 


Glu Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala Lys Asp His 


Val 


Val Glu 


Val 








245 






250 




255 




Glu His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys Gly Gly Ala Lys 


Lys 


Arg Pro 


Ala 






260 








265 




270 




Pro Ser 


Asp Ala Asp 


He 


Ser 


Glu Pro Lys Arg Val 


Arg 


Glu Ser 


Val 




275 










280 


285 






Ala Gin 


Pro 


Ser 


Thr 


Ser Asp Ala Glu Ala Ser He 




Tyr Ala 


Asp 


290 










295 


300 








Arg Leu 


Ala Arg Gly 


His 


Ser 


Leu 








305 








310 












<210> 117 


















<211> 621 


















<212> PRT 


















<213> Artificial Sequence 










<220> 




















<223> Mutant rep protein: rep78 350 GCT 








<400> 117 


















Thr Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val He Lys Val Pro 


Ser 


Asp Leu Asp 


1 






5 






10 




15 




Glu His 


Leu 


Pro 


Gly 


He 


Ser 


Asp Ser Phe Val Asn 


Trp 


Val Ala 


Glu 






20 






25 




30 




Lys Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp Ser Asp Met Asp 


Leu 


Asn Leu 


He 




35 










40 


45 






Glu Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala Glu Lys Leu Gin Arg Asp Phe 


Leu 


50 










55 


60 








Thr Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys Ala Pro Glu Ala 


Leu 


Phe Phe 


Val 


65 








70 




75 






80 


Gin Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr Phe His Met His 


Val 


Leu Val 


Glu 








85 






90 




95 




Thr Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val Leu Gly Arg Phe 


Leu 


Ser Gin 


He 






100 








105 




110 




Arg Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly He Glu 


Pro Thr 


Leu 




115 










120 


125 






Pro Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn Gly Ala Gly Gly Gly 


130 










135 


140 








Asn Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr He Pro Asn Tyr 


Leu 


Leu Pro 


Lys 


145 






150 




155 






160 


Thr Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala Trp Thr Asn Met 


Glu Gin Tyr Leu 








165 






170 




175 




Ser Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg Lys Arg Leu Val Ala Gin His 






180 








185 




190 




Leu Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin Glu Gin Asn Lys 


Glu 


Asn Gin 


Asn 




195 










200 


205 






Pro Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He Arg Ser Lys Thr 


Ser Ala Arg 


Tyr 


210 










215 


220 
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Met 


Glu 


Leu 


Val 


Gly Trp 


Leu 


Val 


Asp 


Lys Gly He 


Thr 


Ser 


Glu 


Lys 


225 










230 








235 








240 


Gin 


Trp 


He 


Gin 


Glu Asp Gin Ala 


Ser 


Tyr He Ser 


Phe 


Asn 


Ala 


Ala 








245 










250 






255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala Leu Asp 


Asn Ala Gly Lys 








260 










265 






270 






He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val 


Gly Gin Gin 






275 








280 






285 




Glu 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He Tyr Lys 


He 


Leu 


Leu 




290 








295 






300 










Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser Val Phe 


Leu Gly Trp Ala 


305 








310 








315 








320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg Asn 


Thr 


He Trp Leu 


Phe Gly Pro Ala 






325 










330 






335 




Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala He Ala 


His 


Ala 


Val 


Pro 








340 










345 






350 






Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu Asn Phe 


Pro 


Phe 


Asn Asp 




355 










360 






365 




Thr 


Ala 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly 


Lys 


Met 


370 






375 






380 






Val 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Lys 


Arg 


385 










390 








395 








400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin He Asp 


Pro 


Thr 


Pro 


Val 






405 










410 






415 




He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala Val He 


Asp 


Gly Asn 


Ser 








420 










425 






430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






435 










440 






445 






Gin 


Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His 


Asp 


Phe Gly Lys 


Val 


Thr 


Lys 




450 










455 






460 








Val 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys Asp His 


Val 


Val 


Glu 


465 






470 








475 








480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 










485 










490 






495 


Val 


Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 


Pro Lys Arg Val 


Arg 


Glu 


Ser 








500 








505 






510 






Ala 


Gin 


Pro 


Ser 


Thr Ser Asp Ala 


Glu 


Ala Ser He 


Asn 


Tyr 


Ala 


Asp 






515 










520 






525 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


His 


Val Gly Met 


Asn 


Leu 


Met 


Leu 


530 










535 






540 






He 




Phe 


Pro 


Cys 


Arg 


Gin Cys 


Glu Arg 


Met 


Asn Gin Asn 


Ser 


Asn 


Cys 


545 






550 








555 








560 


Phe 


Thr 


His 


Gly 


Gin Lys 


Asp 


Cys 


Leu 


Glu Cys Phe 


Pro 


Val 


Ser 


Glu 








565 










570 






575 




Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala Tyr Gin 


Lys 


Leu 


Cys 


Tyr 








580 








585 






590 






He 


His 


His 


He 


Met 


Gly Lys 


Val 


Pro Asp Ala Cys 


Thr 


Ala 


Cys 


Asp 






595 










600 






605 








Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 


Cys 


He Phe Glu 


Gin 










610 










615 






620 











<210> 118 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 350 GCT 
<400> 118 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 



WO 03/018820 



PCT/IB02/04087 



-146- 



15 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys lie Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 " 105 110 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 " ~ 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 ~ 160 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

165 170 - 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 . 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 - 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 " 365 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
385 390 395 

<210> 119 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 350 GCT 
<400> 119 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
15 10 15 
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Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 






Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


lie 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn Gly 


Tyr 


Asp 


305 








Thr Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


lie 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr 


Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 



Gly lie Ser Asp 

Leu Pro Pro Asp 

40 

Leu Thr Val Ala 
55 

Arg Val Ser Lys 
70 

Gly Glu Ser Tyr 
85 

Lys Ser Met Val 

lie Gin Arg lie 

120 

Ala Val Thr Lys 
135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
23 0 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 
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Ser 


Phe 


Val 


Asn 


25 








Ser 


Asp 


Met 


Asp 


Glu 


Lys 


Leu 


Gin 






60 


Ala 


Pro 


Glu 


Ala 






75 




Phe 


His 


Met 


His 




90 






Leu 


Gly 


Arg 


Phe 


105 








Tyr 

' § * 


Arg 


Gly 


He 


Thr 


Arg 


Asn 


Gly 








140 


He 


Pro 


Asn 


Tyr 






155 




Trp 


Thr 


Asn 


Met 


170 






Arg 


Lys 


Arg 


Leu 


J. D J 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


Thr 








220 


Asp 


Lys 


Gly 


He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 








Pro 


Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 






315 




Thr 


He 


Trp 


Leu 




330 






Glu 


Ala 


He 


Ala 


345 








Asn 


Glu 


Asn 


Phe 


Trp 


Glu 


Glu 


Gly 








380 


He 


Leu 


Gly 


Gly 






395 




Ala 


Gin 


He 


Asp 




410 






Cys 


Ala 


Val 


He 


425 








Leu 


Gin 


Asp 


Arg 


Asp 


Phe 


Gly 


Lys 








460 


Ala 


Lys 


Asp 


His 






475 




Gly 


Gly 


Ala 


Lys 




490 






Pro 


Lys 


Arg 


Val 



Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Ala Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
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500 505 
Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Sex lie 

515 ~ 520 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 120 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 350 GCT 



<400> 120 



Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val Asp Lys Gly 


He 


x iix oci uiu iiya 


1 






5 








10 






Gin 


Trp 


lie Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr lie 


Ser 


■pV» <=» Acjn Al Pi A 1 a 
x^i^w npu /vxa rua 




20 










25 




"3 (\ 
O U 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


lie 


Lys 


Ala Ala Leu 


Asp 


Asn Ala Gly Lys 






35 








40 






45 


lie 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr 


Leu 


Val Gly Gin Gin 




50 








55 






60 




Pro 


Val 


Glu Asp 


lie 


Ser 


Ser 


Asn Arg lie Tyr 


Lys 


lie Leu Glu Leu 


65 






70 






75 




80 


Asn Gly 


Tyr Asp 


Pro 


Gin Tyr 


Ala 


Ala Ser Val 


Phe 


Leu Gly Trp Ala 








85 








90 




95 


Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 


Asn 


Thr lie Trp 


Leu 


Phe Gly Pro Ala 






100 










105 




110 


Thr Thr Gly Lys 


Thr 


Asn 


lie 


Ala 


Glu Ala lie 


Ala 


His Ala Val Pro 






115 








120 






125 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu Asn 


Phe 


Pro Phe Asn Asp 




130 








135 






140 




Cys 


Val 


Asp Lys 


Met 


Val 


lie 


Trp Trp Glu Glu Gly 


Lys Met Thr Ala 


145 








150 






155 




160 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


lie Leu Gly Gly 


Ser Lys Val Arg 








165 








170 




175 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser Ala Gin lie Asp 


Pro Thr Pro Val 






T80 










185 




190 


lie 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala Val 


He 


Asp Gly Asn Ser 






195 








200 




205 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met Phe Lys Phe 




210 








215 






220 




Glu Leu Thr Arg Arg 


Leu Asp 


His 


Asp Phe Gly Lys 


Val Thr Lys Gin 


225 








230 






235 




240 


Glu 


Val 


Lys Asp 


Phe 


Phe Arg 


Trp 


Ala Lys Asp 


His 


Val Val Glu Val 








245 








250 




255 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala 


Lys 


Lys Arg Pro Ala 






260 










265 




270 


Pro 


Ser Asp Ala Asp 


lie 


Ser 


Glu 


Pro Lys Arg Val 


Arg Glu Ser Val 






275 








280 






285 


Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala 


Glu Ala Ser 


He 


Asn Tyr Ala Asp 




290 








295 






300 




Arg Leu Ala Arg Gly 


His 


Ser 


Leu 








305 








310 













<210> 121 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



510 

Asn Tyr Ala Asp 
525 
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<220> 

<223> Mutant rep protein: rep78 



<400> 121 



Thr 


Ala 


Gly Phe 


Tyr 
5 


Glu 


He 


Val 


1 


His 


Leu 


Pro 


Glv 


He 


Ser 


Asp 








20 










T.V«5 


V3X LX 


Trp 


Glu 


Leu 


Pro 


Pro 


Aso 

X XW 




35 










40 


Ol IX 


V3XXX 


Ala 


Pro 




Thr 


Val 


Ala 














55 




X 11X 


wX LX 


Trp Arg 


**xy 


Val 


Ser 

»wf V— -1* 


Lvs 


O — * 










70 






will 


rue 


Glu 


Lys 


Glv 


Glu 




xy x 










85 










xxxx 


Gly Val 




kj C JL. 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


Tie 

A. xc 


Gin 




lie 

JL -J* w 






115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Th*z* 


IjVS 




130 










135 




Asn 


Lys 


Val 


Val 




Glu 




TV5T 

A y- L 


145 










150 






Thr 


Gin 


Pro 


Glu 


Leu 

-fpJ W U 


Gin 

will 


Trn 

XX£J 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


AcjTI 




Thr 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 






• 




200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




210 










215 




Met 


Glu 


Leu 


Val 


Glv 


Tr*D 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 




Gin 


Ala 










245 








Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 








260 










lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 




290 










295 




Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


305 










310 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 










325 








Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Ala 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 






355 










360 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 




370 










375 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 










Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 






435 










440 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 



354 GCC 



lie juys 


Val 


Pro 


Ser 


Asp Leu 


Asp 


i n 

iw 








15 




ber Pne 


Val 


Asn 


Trp 


Val Ala 


Glu 


ZD 








30 




Ser Asp 


Met Asp 


Leu 


Asn Leu 


He 








45 






Glu Lys 


Leu Gin Arg 


Asp Phe 


Leu 






60 








Ala Pro 


Glu 


Ala 


Leu 


Phe Phe 


Val 




75 








80 


Phe His 


Met 


His 


Val 


Leu Val 


Glu 










95 




Leu Gly 


Arg 


Phe 


Leu 


Ser Gin 


He 


105 








110 




Tyr Arg 


Gly 


He 


Glu 


Pro Thr 


Leu 








125 






Thr Arg 


Asn Gly Ala 


Gly Gly Gly 






140 








He Pro 


Asn Tyr 


Leu 


Leu Pro 


Lys 




155 








160 


Trp Thr 


Asn 


Met 


Glu 


Gin Tyr 


Leu 


X / u 








175 




Arg Lys 


Arg 


Leu 


Val 


Ala Gin 


His 


185 








190 




Glu Gin 


Asn 


Lys 


Glu 


Asn Gin 


Asn 






205 






Arg ser 


Lys 


Thr 


Ser 


Ala Arg 


Tyr 






220 








Asp iiys 


Gly 


He 


Thr 


Ser Glu 


Lys 




235 








240 


ser Tyr 


He 


Ser 


Phe 


Asn Ala 


Ala 










255 




Ala Ala 


Leu Asp 


Asn 


Ala Gly Lys 


265 








270 




Pro Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 








285 






Arg He 


Tyr 


Lys 


He 


Leu Glu 


Leu 






300 








Ala Ser 


Val 


Phe 


Leu 


Gly Trp Ala 




315 








320 


Thr He 


Trp 


Leu 


Phe 


Gly Pro 


Ala 


330 








335 




Glu Ala 


He 


Ala 


His 


Thr Val 


Pro 


345 








350 




Asn Glu 


Asn 


Phe 


Pro 


Phe Asn Asp 








365 






Trp Glu 


Glu Gly Lys 


Met Thr 


Ala 






380 








He Leu 


Gly Gly 


Ser 


Lys Val 


Arg 




395 








400 


Ala Gin 


He 


Asp 


Pro 


Thr Pro 


Val 


410 








415 




Cys Ala 


Val 


He Asp 


Gly Asn 


Ser 


425 








430 




Leu Gin 


Asp 


Arg 


Met 


Phe Lys 


Phe 








445 






Asp Phe 


Gly 


Lys 


Val 


Thr Lys 


Gin 
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450 










455 




460 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp Ala Lys Asp His Val Val Glu Val 


465 










470 






475 480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys Lys Arg Pro Ala 










485 








490 *" " 495 


Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro Lys Arg Val Arg Glu Ser Val 








500 










505 " 510 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu Ala Ser He Asn Tyr Ala Asp 






515 










520 


525 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg His Val Gly Met Asn Leu Met Leu 




530 










535 




540 


Phe 


Pro 


Cys 


Arg Gin 


Cys 


Glu 


Arg 


Met Asn Gin Asn Ser Asn He Cys 


545 










550 






555 560 


Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu Glu Cys Phe Pro Val Ser Glu 










565 








570 575 


Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys Ala Tyr Gin Lys Leu Cys Tyr 








580 










585 590 


lie 


His 


His 


He 


Met 


Gly 


Lys 


Val 


Pro Asp Ala Cys Thr Ala Cys Asp 






595 










600 


605 


Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 


Cys He Phe Glu Gin 




610 










615 




620 



<210> 122 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 354 GCC 



<400> 122 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys Gly He Thr 


Ser Glu Lys 


1 








5 










10 


15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr He Ser Phe 


Asn Ala Ala 








20 










25 


30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala Leu Asp Asn Ala Gly Lys 






35 










40 




45 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp Tyr Leu Val 


Gly Gin Gin 




50 










55 






60 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


He Tyr Lys He 


Leu Glu Leu 


65 








70 








75 


80 


Asri Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser Val Phe Leu Gly Trp Ala 










85 










90 


95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He Trp Leu Phe 


Gly Pro Ala 








100 










105 




110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala He Ala His 


Thr Val Pro 






115 










120 




125 




Phe 


Ala 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu Asn Phe Pro 


Phe Asn Asp 




130 










135 






140 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp Trp Glu Glu Gly Lys 


Met Thr Ala 


145 










150 








155 


160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly Ser Lys Val Arg 










165 










170 


175 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin He Asp Pro 


Thr Pro Val 








180 










185 




190 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala Val He Asp 


Gly Asn Ser 






195 










200 


205 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg Met 


Phe Lys Phe 




210 










215 






220 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe Gly Lys Val 


Thr Lys Gin 


225 










230 








235 


240 
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Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala Lys Asp His 


Val Val 


Glu 


Val 










245 


250 




255 




Glu 


His 


Glu 


Phe 


Tyr 


Val Lys Lys Gly Gly Ala Lys 


Lys Arg 


Pro 


Ala 








260 




265 


270 






Pro 


Ser 


Asp 


Ala 


Asp 


He Ser Glu Pro Lys Arg Val 


Arg Glu 


Ser 


Val 






275 






280 


285 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp Ala Glu Ala Ser He 


Asn Tyr Ala Asp 




290 








295 300 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser Arg His Val Gly Met 


Asn Leu 


Met 


Leu 


305 










310 315 






320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys Glu Arg Met Asn Gin Asn Ser Asn 


He 


Cys 










325 


330 




335 


Phe 


Thr 


His 


Gly Gin 


Lys Asp Cys Leu Glu Cys Phe 


Pro Val 


Ser 


Glu 








340 




345 


350 






Ser 


Gin 


Pro 


Val 


Ser 


Val Val Lys Lys Ala Tyr Gin 


Lys Leu 


Cys 


Tyr 






355 






360 


365 






He 


His 


His 


He 


Met 


Gly Lys Val Pro Asp Ala Cys 


Thr Ala Cys Asp 




370 








375 380 








Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp Cys He Phe Glu 


Gin 







385 390 395 



<210> 123 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 354 GCC 



<400> 123 



Thr 


Ala 


Gly Phe 


Tyr 


Glu 


He 


Val 


He Lys Val Pro 


Ser 


Asp Leu Asp 


1 






5 








10 




15 


Glu 


His 


Leu Pro 


Gly 


He 


Ser Asp 


Ser Phe Val Asn 


Trp 


Val Ala Glu 






20 










25 




30 


Lys 


Glu 


Trp Glu 


Leu 


Pro 


Pro Asp 


Ser Asp Met Asp 


Leu 


Asn Leu He 






35 








40 




45 




Glu 


Gin 


Ala Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys Leu Gin 


Arg Asp Phe Leu 




50 








55 




60 






Thr 


Glu 


Trp Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro Glu Ala 


Leu 


Phe Phe Val 


65 








70 






75 




80 


Gin 


Phe 


Glu Lys 


Gly Glu 


Ser 


Tyr 


Phe His Met His 


Val 


Leu Val Glu 








85 








90 




95 


Thr 


Thr 


Gly Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe 


Leu 


Ser Gin He 






100 










105 




110 


Arg 


Glu 


Lys Leu 


He 


Gin 


Arg 


He 


Tyr Arg Gly He 


Glu 


Pro Thr Leu 






115 








120 




125 




Pro 


Asn 


Trp Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala Gly Gly Gly 




130 








135 




140 






Asn Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


He Pro Asn Tyr 


Leu 


Leu Pro Lys 


145 








150 






155 




160 


Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp Ala Trp Thr Asn Met 


Glu 


Gin Tyr Leu 








165 








170 




175 


Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr Glu Arg Lys Arg Leu Val Ala Gin His 






180 










185 




190 


Leu 


Thr 


His Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin Asn Lys 


Glu 


Asn Gin Asn 






195 








200 


205 




Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He Arg Ser Lys Thr Ser Ala Arg Tyr 




210 








215 




220 






Met 


Glu 


Leu Val 


Gly Trp 


Leu 


Val 


Asp Lys Gly He 


Thr 


Ser Glu Lys 


225 








230 






235 




240 


Gin 


Trp 


He Gin 


Glu Asp 


Gin 


Ala 


Ser Tyr He Ser 


Phe 


Asn Ala Ala 
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245 





7\ e« 
Abil 


Ser Arg Ser 


Gin 


He 


Lys 


Ala 






260 








265 


Tip 




Ser Leu Thr Lys 


Thr 


Ala 


Pro 






275 






280 




riO 


V a. J. 


Glu Asp He 


Ser 


Ser Asn Arg 




i o rt 

290 






295 






Asn 


QjXy 


Tyr Asp Pro 


Gin 


Tyr Ala Ala 


305 






310 








Thx 


Lys 


Lys Phe Gly Lys 


Arg Asn 


Thr 






325 










Thr 


Thr 


Gly Lys Thr 


Asn 


He 


Ala 


Glu 






340 








345 


Pne 


Ala. 


Gly Cys Val 


Asn 


Trp JThr Asn 






355 






360 




Cys 


Val 


Asp Lys Met 


Val 


He 


Trp 


Trp 




370 






375 






Lys 


Val 


Val Glu Ser 


Ala 


Lys 


Ala 


He 


385 






390 








Val 


Asp 


Gin Lys Cys 


Lys 


Ser 


Ser 


Ala 






405 










He 


Val 


Thr Ser Asn 


Thr 


Asn 


Met 


Cys 






420 








425 


Thr 


Thr 


Phe Glu His 


Gin 


Gin 


Pro 


Leu 






435 






440 




Glu 


Leu 


Thr Arg Arg Leu 


Asp 


His 


Asp 




450 






455 






Glu 


Val 


Lys Asp Phe 


Phe 


Arg 


Trp 


Ala 


465 






470 








Glu 


His 


Glu Phe Tyr Val 


Lys 


Lys 


Gly 






485 










Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 






500 








505 


Ala 


Gin 


Pro Ser Thr 


Ser 


Asp 


Ala 


Glu 






515 






520 




Arg 


Leu 


Ala Arg Gly His 


Ser 


Leu 






530 






535 







<210> 124 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 354 



<400> 124 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


1 








5 










Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 








25 


Ser 


Asn 


Ser 
35 


Arg 


Ser 


Gin 


He 


Lys 
40 


Ala 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 








55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


65 










70 








Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 










85 










Thr 


Lys 


Lys 


Phe 
100 


Gly 


Lys 


Arg 


Asn 


Thr 
105 
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250 








255 




Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 








270 






Asp 


Tyr Leu 


Val 


Gly Gin 


Jim* 

Gin 






285 








He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




300 










Ser 


Val Phe 


Leu Gly Trp 


Ala 




315 








320 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 


330 








335 




Ala 


He Ala 


His 


Thr 


Val 


Pro 








350 






Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 






365 








Glu Glu Gly 


Lys 


Met 


Thr 


Ala 




380 










Leu Gly Gly 


Ser 


Lys 


Val 


Arcr 




395 








400 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 


410 






415 




Ala 


Val He 


Asp 


Gly Asn 


Ser 








430 






Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






445 








Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




460 










Lys 


Asp His 


Val 


Val 


Glu 


Val 


475 








480 


Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 


490 








495 




Lys Arg Val 


Arg 


Glu 


Ser 


Val 








510 






Ala 


Ser He 


Asn 


Tyr Ala 


Asp 






525 








GCC 












Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


10 








15 




Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 






30 






Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 






45 








Asp 


Tyr Leu 


Val 


Gly Gin 


Gin 




60 










He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




75 








80 


Ser 


Val Phe 


Leu 


Gly Trp 


Ala 


90 








95 




He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 



110 
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Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Ala 


Gly Cys 


Val Asn Trp Thr 




130 










135 




Cvs 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser Ala Lys Ala 








165 








Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr Val 


Lys 


Lys 








260 










Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 




290 










295 




Arg 


Leu 


Ala Arg 


Gly His 


Ser 


Leu 


305 










310 






<210> 125 












<211> 621 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> Mutant rep protein: rep78 


<400> 125 












Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


1 








5 








Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 






35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 






115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 










135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 










150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 








180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 



Glu 


Ala 


lie 


Ala Hxs Thr Val Pro 










Asn 


Glu 


Asn 


Phe Pro Phe Asn Asp 








140 


Trp Glu 


Glu 


Gly Lys Met Thr Ala 






155 


160 


He 


Leu 


Gly 


Gly Ser Lys Val Arg 




170 




175 


Ala 


Gin 


He 


Asp Pro Thr Pro Val 


185 






190 


Cys 


Ala 


Val 


He Asp Gly Asn Ser 








205 


Leu 


Gin 


Asp 


Arg Met Phe Lys Phe 








220 


Asp 


Phe 


Gly 


Lys Val Thr Lys Gin 






235 


240 


Ala 


Lys 


Asp 


His Val Val Glu Val 




250 




255 


Gly Gly 


Ala 


Lys Lys Arg Pro Ala 


265 






270 


Pro 


Lys 


Arg 


Val Arg Glu Ser Val 








285 


Glu 


Ala 


Ser 


He Asn Tyr Ala Asp 



300 



363 GCC 



He Lys 


Val 


Pro 


Ser Asp 


Leu 


Asp 


10 








15 




Ser Phe 


Val 


Asn 


Trp Val 


Ala 


Glu 


25 






30 






Ser Asp 


Met 


Asp 


Leu Asn 


Leu 


He 








45 






Glu Lys 


Leu 


Gin Arg Asp 


Phe 


Leu 






60 








Ala Pro 


Glu 


Ala 


Leu Phe 


Phe 


Val 




75 








80 


Phe His 


Met 


His 


Val Leu 


Val 


Glu 


90 








95 




Leu Gly 


Arg 


Phe 


Leu Ser 


Gin 


He 


105 






110 






Tyr Arg 


Gly 


He 


Glu Pro 


Thr 


Leu 






125 






Thr Arg 


Asn 


Gly Ala Gly Gly Gly 






140 








He Pro 


Asn 


Tyr 


Leu Leu 


Pro 


Lys 




155 






160 


Trp Thr 


Asn 


Met 


Glu Gin Tyr 


Leu 


170 








175 




Arg Lys 


Arg 


Leu 


Val Ala 


Gin 


His 


185 






190 






Glu Gin 


Asn 


Lys 


Glu Asn 


Gin 


Asn 
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195 

J- -/ —J 










200 




205 








Pro 


Asn 




Asp Ala 


nu 


Val 


He 


AT*a S^t* TtVfi Tht* 


Ser 


Ala 


Arg Tyr 




210 










215 




220 










Met 


Glu 


IicU 


Val 


Giy 


lip 


Leu 


Val 


nop iiys Vj J. jr IXC 


Thr 


Ser 




jjyb 


225 










230 






235 








240 


Gin 


Trp 


Tin 

lie 


Gin 






Gin 


Ala 


ocl l y i lie ocx 


Phe 


Asn 


Aia 


TV T -a 

Aia 








245 








250 






255 




Ser 


Asn 


Ser 


Arg 
260 


ber 


isJ.ll 


He 


Lys 


Ala Ala lieu ASp 

265 


Asn 


Ala 
270 


Giy 


Lys 


lie 


Met 


Ser 


Leu 


Thr 


uys 


Thr Ala 


±rro Asp iyr i>eu 


Val 


Gly 


Gin 


Gin 
















280 




285 








Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He Tyr Lys 


He 


Leu 


GlU 


.Leu 




290 








295 














Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr Ala 


Aia ber vai Fne 


Leu Gly 


Trp 


Ala 


305 










310 






315 








o n 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg Asn 


Thr He Trp Leu 


Phe Gly 


Pro 


Ala 










325 














335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Giu Aia lie Aia 


His 


Thr 


Val 


Pro 






340 










j4d 




350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu Aia Pne 


Pro 


Phe 


Asn 


Asp 




Job 










360 




365 








Cys Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp GlU G1U Gly 


Lys 


Met 


Thr 


Ala 




370 










375 




ion 
Joy 










Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


385 










390 














400 


Val 


Asp 


Gin 


Lys 


Cys 
405 


Lys 


Ser 


Ser 


Ala Gin lie Asp 
410 


Pro 


Thr 


Pro 
415 


Val 


He 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys Ala val lie 


Asp 


Gly 
430 


Asn 


Ser 


Thr 


Thr 


Fne 

ft jD 


Glu 


His 


Gin 


Gin 


Pro 
440 


jueu Gin Asp Arg 


Met 
445 


Phe 


Lys 


Phe 


Glu 


Leu 


Tnr 


Arg Arg 


Leu Asp 


His 


Asp pne Gly iiys 


Val 


Thr 


Lys 


Gin 




450 










455 




460 










Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp 


Ala Lys Asp His 


Val 


Val 


Glu 


Val 


465 










470 






475 








480 


Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Gly Gly Ala Lys 
490 


Lys 


Arg 


Pro 
495 


Ala 


Pro 


Ser 


Asp 


Ala Asp 


He 


Ser 


Glu 


Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 








SOO 










505 




510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp Ala 


Glu Ala Ser He 


Asn 


Tyr 


Ala Asp 






515 










520 




525 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg 


His Val Gly Met 


Asn 


Leu 


Met 


Leu 




530 










535 




540 










Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


Met Asn Gin Asn 


Ser 


Asn 


He 


Cys 


545 










550 






555 








560 


Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu Glu Cys Phe 


Pro 


Val 


Ser 


Glu 










565 








570 






575 




Ser 


Gin 


Pro 


Val 
580 


Ser 


Val 


Val 


Lys 


Lys Ala Tyr Gin 
585 


Lys 


Leu 
590 


Cys 


Tyr 


He 


His 


His 


He 


Met 


Gly Lys Val 


Pro Asp Ala Cys 


Thr 


Ala 


Cys Asp 






595 










600 




605 








Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 


Cys He Phe Glu 


Gin 










610 










615 




620 











<210> 126 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 52 363 GCC 
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<400> 126 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe ' Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

100 105 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Ala Phe Pro Phe Asn Asp 

130 135 140 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

180 185 190 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 ~ 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp lie Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 

290 * 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 

325 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 * 350 * 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 

355 360 365 

He His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
385 390 395 

<210> 127 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep.68 363 GCC 



<400> 127 
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Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 ** 105 ~ 110 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 ~ " 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 * 135 " 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 " 150 ~ 155 ~ 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 ~ 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 " 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 " 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 ~ 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 ~ ~ 285 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 

290 295 " 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 ~ 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Ala Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 ~ 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 ~ 475 480 

Glu His (3lu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
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485 

Pro Ser Asp Ala Asp lie Ser 

500 

Ala Gin Pro Ser Thr Ser Asp 
515 

Arg Leu Ala Arg Gly His Ser 
530 535 

<210> 128 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 363 GCC 



<400> 128 



Met 


Glu 


Leu Val 


Gly Trp 


Leu 


Val 


Asp 


Lys 


Gly He Thr Ser Glu Lys 


1 






5 










10 


15 


Gin Trp 


He Gin 


Glu Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser Phe Asn Ala Ala 






20 










25 




30 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp Asn Ala Gly Lys 






35 








40 






45 


lie 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu Val Gly Gin Gin 




50 






55 








60 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn Arg 


He 


Tyr Lys He Leu Glu Leu 


65 








70 










75 80 


Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe Leu Gly Trp Ala 








85 










90 


95 


Thr 


Lys 


Lys Phe 


Gly Lys 


Arg 


Asn 


Thr 


He 


Trp Leu Phe Gly Pro Ala 






100 










105 




110 


Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala His Thr Val Pro 






115 








120 






125 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Ala Phe Pro Phe Asn Asp 




130 








135 








140 


Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp Trp Glu Glu Gly Lys Met Thr Ala 


145 








150 










155 160 


Lys Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly Ser Lys Val Arg 








165 










170 


175 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp Pro Thr Pro Val 






180 










185 




190 


lie 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val He Asp Gly Asn Ser 






195 








200 






205 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp Arg Met Phe Lys Phe 




210 








215 








220 


Glu 


Leu Thr Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly Lys Val Thr Lys Gin 


225 








230 










235 240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys 


Asp His Val Val Glu Val 








245 










250 


255 


Glu 


His 


Glu Phe 


Tyr Val 


Lys 


Lys 


Gly Gly Ala Lys Lys Arg Pro Ala 






260 










265 




270 


Pro 


Ser Asp Ala 


Asp 


He 


Ser 


Glu 


Pro Lys Arg Val Arg Glu Ser Val 






275 






280 






285 


Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser He Asn Tyr Ala Asp 




290 








295 








300 


Arg Leu Ala Arg 


Gly His 


Ser 


Leu 








305 








310 













<210> 129 
<211> 621 
<212> PRT 



490 

Glu Pro Lys 

505 
Ala Glu Ala 
520 
Leu 



Arg Val Arg 

Ser He Asn 
525 



495 
Glu Ser Val 
510 

Tyr Ala Asp 
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<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 



<400> 129 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


1 








5 








Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 






35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 










85 








Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 






115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 








135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 










150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 








180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 




210 










215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 










245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 








260 










lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 








280 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 




290 










295 




Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


305 










310 






Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 










325 








Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 






355 










360 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 




370 










375 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 










Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 



-158- 



364 


GCT 










He 


Lys 


Val Pro 


Ser 


Asp 


Leu Asp 




10 








15 


Ser 


Phe 


Val Asn Trp 


Val 


Ala Glu 


25 








30 




Ser Asp 


Met Asp 


Leu 


Asn 


Leu He 








45 






Glu 


Lys 


Leu Gin Arg 


Asp 


Phe Leu 






60 








Ala 


Pro 


Glu Ala 


Leu 


Phe 


Phe Val 






75 






80 


Phe 


His 


Met His 


Val 


Leu 


Val Glu 




90 








95 


Leu 


Gly 


Arg Phe 


Leu 


Ser 


Gin He 


105 






110 




Tyr Arg 


Gly He 


Glu 


Pro 


Thr Leu 








125 






Thr Arg 


Asn Gly Ala 


Gly Gly Gly 






140 








He 


Pro 


Asn Tyr 


Leu 


Leu 


Pro Lys 






155 






160 


Trp 


Thr 


Asn Met 


Glu 


Gin 


Tyr Leu 


170 








175 


Arg 


Lys 


Arg Leu Val 


Ala 


Gin His 


185 








190 




Glu 


Gin 


Asn Lys 


Glu 


Asn 


Gin Asn 








205 






Arg 


Ser 


Lys Thr 


Ser 


Ala Arg Tyr 






220 








Asp 


Lys 


Gly He 


Thr 


Ser 


Glu Lys 






235 






240 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala Ala 




250 








255 


Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly Lys 


265 








270 




Pro 


Asp 


Tyr Leu 


Val 


Gly Gin Gin 








285 






Arg 


He 


Tyr Lys 


He 


Leu 


Glu Leu 






300 








Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 






315 






320 


Thr 


He 


Trp Leu 


Phe 


Gly Pro Ala 




330 








335 


Glu 


Ala 


He Ala 


His 


Thr 


Val Pro 


345 








350 




Asn 


Glu 


Asn Ala 


Pro 


Phe 


Asn Asp 








365 






Trp 


Glu 


Glu Gly Lys 


Met 


Thr Ala 




380 








He 


Leu 


Gly Gly Ser 


Lys Val Arg 






395 






400 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro Val 




410 








415 


Cys 


Ala 


Val He 


Asp 


Gly Asn Ser 


425 








430 




Leu 


Gin 


Asp Arg Met 


Phe 


Lys Phe 
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435 






440 






445 








Glu 


Leu 


Thr 


Arg 


Arg 


Leu Asp His 


Asp 


Phe Gly Lys 


Val 


inr 


Jbys 


<jj.n 




450 






455 




460 










Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp 


Ala Lys Asp His Val 


Val 


Glu 


Val 


465 






470 




475 








480 


Glu 


His 


Glu 


Phe 


Tyr 


Val Lys Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 










485 






490 






495 




Pro 


Ser 


Asp Ala Asp 


He Ser Glu 


Pro 


Lys Arg Val Arg 


Glu 


Ser 


Val 








500 






505 






510 






Ala 


Gin 


Pro 
515 


Ser 


Thr 


Ser Asp Ala 
520 


Glu 


Ala Ser He 


Asn 
525 


Tyr 


Ala 


Asp 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser Arg 


His 


Val Gly Met 


Asn 


Leu 


Met 


Leu 


530 








535 




540 










Phe 


Pro 


Cys Arg 


Gin 


Cys Glu Arg 


Met 


Asn Gin Asn 


Ser 


Asn 


He 


Cys 


545 










550 




555 








560 


Phe 


Thr 


His 


Gly Gin 


Lys Asp Cys 


Leu Glu Cys Phe 


Pro 


Val 


Ser 


Glu 










565 




570 






575 




Ser 


Gin 


Pro 


Val 


Ser 


Val Val Lys 


Lys 


Ala Tyr Gin Lys 


Leu 


Cys 


Tyr 








580 






585 






590 






He 


His 


His 
595 


He 


Met 


Gly Lys Val 
600 


Pro 


Asp Ala Cys 


Thr 
605 


Ala 


Cys 


Asp 


Leu 


Val 
610 


Asn 


Val 


Asp 


Leu Asp Asp 
615 


Cys 


He Phe Glu 
620 


Gin 









<210> 130 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 364 GCT 



<400> 130 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


1 








5 










10 








15 




Gin Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 








20 








25 








30 






Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 






35 








40 








45 








He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 


Gly 


Gin 


Gin 




50 










55 








60 










Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr Lys 


He 


Leu 


Glu 


Leu 


65 










70 










75 








80 


Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly 


Trp 


Ala 










85 










90 








95 




Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 






100 










105 








110 






Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 






115 








120 








125 








Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Ala 


Pro 


Phe 


Asn 


Asp 




130 










135 








140 










Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp Glu Glu Gly Lys 


Met 


Thr 


Ala 


145 










150 










155 








160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly Ser 


Lys 


Val 


Arg 








165 










170 








175 




Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 








180 








185 








190 






He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val He 


Asp 


Gly 


Asn 


Ser 






195 










200 








205 








Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg Met 


Phe 


Lys 


Phe 



210 215 220 
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Glu 


Leu 


Thr Arg Arg 


Leu Asp His 


Asp Phe Gly Lys 


Val 


Thr 


Lys Gin 


225 










230 


235 






240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp 


Ala Lys Asp His 


Val 


Val 


Glu Val 










245 




250 






255 


Glu 


His 


Glu 


Phe 


Tyr 


Val Lys Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro Ala 








260 






265 




270 




Pro 


Ser 


Asp Ala Asp 


He Ser Glu 


Pro Lys Arg Val 


Arg 


Glu 


Ser Val 






275 






280 




285 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp Ala 


Glu Ala Ser He 


Asn 


Tyr Ala Asp 




290 








295 


300 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser Arg His Val Gly Met Asn Leu Met Leu 


305 










310 


315 






320 


Phe 


Pro 


Cys Arg 


Gin 


Cys Glu Arg Met Asn Gin Asn 


Ser 


Asn 


He Cys 










325 




330 






335 


Phe 


Thr 


His 


Gly Gin 


Lys Asp Cys 


Leu Glu Cys Phe 


Pro 


Val 


Ser Glu 








340 






345 




350 




Ser 


Gin 


Pro. 


Val 


Ser 


Val Val Lys 


Lys Ala Tyr Gin 


Lys 


Leu 


Cys Tyr 






355 






360 




365 






lie 


His 


His 


He 


Met 


Gly Lys Val 


Pro Asp Ala Cys 


Thr 


Ala 


Cys Asp 




370 








375 


380 








Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 


Cys He Phe Glu 


Gin 






385 










390 


395 









<210> 131 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 3 64 GCT 



<400> 131 



Thr 


Ala 


Gly Phe 


Tyr 


Glu 


He 


Val 


He Lys Val Pro Ser Asp 


Leu Asp 


1 






5 








10 


15 


Glu 


His 


Leu Pro 


Gly 


He 


Ser 


Asp 


Ser Phe Val Asn Trp Val 


Ala Glu 






20 










25 30 




Lys 


Glu 


Trp Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp Leu Asn 


Leu He 






35 








40 


45 




Glu 


Gin 


Ala Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys Leu Gin Arg Asp 


Phe Leu 




50 








55 




60 




Thr 


Glu 


Trp Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro Glu Ala Leu Phe 


Phe Val 


65 








70 






75 


80 


Gin 


Phe 


Glu Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His Met His Val Leu 


Val Glu 








85 








90 


95 


Thr 


Thr 


Gly Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe Leu Ser 


Gin He 






100 










105 * ~ 110 




Arg Glu 


Lys Leu 


He 


Gin 


Arg 


He 


Tyr Arg Gly He Glu Pro 


Thr Leu 






115 








120 


125 




Pro 


Asn 


Trp Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala Gly Gly Gly 




130 








135 




140 




Asn Lys 


Val Val 


Asp 


Glu 


Cys 


Tyr 


He Pro Asn Tyr Leu Leu 


Pro Lys 


145 








150 






155 


160 


Thr 


Gin 


Pro Glu 


Leu 


Gin 


Trp 


Ala Trp Thr Asn Met Glu Gin Tyr Leu 








165 








170 


175 


Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr 


Glu Arg Lys Arg Leu Val Ala 


Gin His 






180 










185 190 




Leu 


Thr 


His Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin Asn Lys Glu Asn 


Gin Asn 






195 








200 


205 




Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He Arg Ser Lys Thr Ser Ala Arg Tyr 




210 








215 




220 




Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


Asp Lys Gly He Thr Ser 


Glu Lys 
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225 








230 








235 




Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He Ser 


Phe Asn Ala Ala 






245 






250 




255 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


He Lys 


Ala Ala 


Leu Asp 


Asn Ala Gly Lys 






260 










265 




#"i r\ 

270 


lie 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr Leu 


Val Gly Gin Gin 






275 






280 






285 


Pro 


Val 


Glu Asp 


He 


Ser 


Ser Asn Arg He 


Tyr Lys 


He Leu Glu Leu 




290 






295 






300 




Asn 


Gly 


Tyr Asp 


Pro 


Gin Tyr Ala Ala Ser 


Val Phe 


Leu Gly Trp Ala 


305 




310 








315 


320 


Thr 


Lys 


Lys Phe 


Gly 


Lys Arg Asn 


Thr He 


Trp Leu 


Phe Gly Pro Ala 




325 








330 




335 


Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu Ala 


He Ala 


His Thr Val Pro 






340 










345 




350 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu 


Asn Ala 


Pro Phe Asn Asp 




355 








360 






365 


Cys 


Val 


Asp Lys 


Met 


Val 


He Trp Trp Glu Glu Gly 


Lys Met Thr Ala 


370 






375 






380 




Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly 


Ser Lys Val Arg 


385 








390 








395 


400 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala Gin 


He Asp 


Pro Thr Pro Val 




405 






410 




415 


He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala 


Val He 


Asp Gly Asn Ser 






420 










425 




430 


Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met Phe Lys Phe 






435 








440 






445 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp Phe 


Gly Lys 


Val Thr Lys Gin 




450 








455 






460 




Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


Ala Lys 


Asp His 


Val Val Glu Val 


465 






470 








475 


480 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys Arg Pro Ala 








485 






490 




495 


Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 


Pro Lys 


Arg Val 


Arg Glu Ser Val 






500 








505 




510 


Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu Ala 


Ser He 


Asn Tyr Ala Asp 






515 






520 






525 


Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 








530 








535 











<210> 132 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 364 GCT 



<400> 132 



Met Glu 


Leu 


Val 


Gly 


Trp 


Leu Val Asp 


Lys 


Gly 


He 


Thr Ser Glu Lys 


1 






5 




10 






15 


Gin Trp 


He 


Gin 


Glu 


Asp 


Gin Ala Ser 


Tyr 


He 


Ser 


Phe Asn Ala Ala 




20 




25 








30 


Ser Asn 


Ser 


Arg 


Ser 


Gin 


He Lys Ala 


Ala 


Leu Asp 


Asn Ala Gly Lys 




35 






40 








45 


He Met 


Ser 


Leu 


Thr 


Lys 


Thr Ala Pro 


Asp 


Tyr 


Leu 


Val Gly Gin Gin 


50 








55 






60 


He Leu Glu Leu 


Pro Val 


Glu 


Asp 


He 


Ser 


Ser Asn Arg 


He 


Tyr 


Lys 


65 






70 






75 




80 


Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr Ala Ala 


Ser 


Val 


Phe 


Leu Gly Trp Ala 


85 






90 






95 
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Thr 


Lys 


Lys 


Phe Gly Lys 


Arg Asn 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 








100 






105 








110 






Thr 


Thr 


Gly Lys Thr 


Asn 


lie Ala 


Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 






115 






120 








125 








Phe 


Tyr 


Gly Cys Val 


Asn 


Trp Thr 


Asn 


Glu 


Asn Ala 


Pro 


Phe 


Asn Asp 




130 








135 






140 










Cys 


Val 


Asp 


Lys Met 


Val 


He Trp Trp Glu Glu Gly Lys Met 


Thr 


Ala 


145 








150 








155 








160 


Lys 


Val 


Val 


Glu Ser 


Ala 


Lys Ala 


He 


Leu 


Gly Gly Ser Lys 


Val 


Arg 








165 








170 








175 




Val 


Asp 


Gin 


Lys Cys 


Lys 


Ser Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 








180 






185 








190 






He 


Val 


Thr 


Ser Asn 


Thr 


Asn Met 


Cys 


Ala 


Val He 


Asp 


Gly Asn 


Ser 






195 






200 






205 








Thr 


Thr 


Phe 


Glu His 


Gin 


Gin Pro 


Leu Gin Asp Arg Met 


Phe 


Lys 


Phe 




210 








215 






220 










Glu 


Leu 


Thr 


Arg Arg 


Leu 


Asp His Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 


225 








230 








235 








240 


Glu 


Val 


Lys 


Asp Phe 


Phe 


Arg Trp 


Ala 


Lys 


Asp His 


Val 


Val 


Glu 


Val 








245 








250 








255 




Glu 


His 


Glu 


Phe Tyr 


Val 


Lys Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 








260 






265 








270 






Pro 


Ser 


Asp Ala Asp 


He 


Ser Glu 


Pro 


Lys 


Arg Val 


Arg 


Glu 


Ser 


Val 






275 






280 








285 








Ala 


Gin 


Pro 


Ser Thr 


Ser 


Asp Ala 


Glu 


Ala 


Ser He 


Asn 


Tyr 


Ala 


Asp 




290 








295 






300 










Arg 


Leu 


Ala Arg Gly His 


Ser Leu 

















305 310 



<210> 133 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 367 GCC 



<400> 133 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys Val Pro Ser Asp 


Leu Asp 


1 








5 








10 


15 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp Ser Phe Val Asn Trp Val Ala Glu 








20 








25 30 




Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp Leu Asn 


Leu He 






35 










40 


45 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys Leu Gin Arg Asp 


Phe Leu 




50 










55 




60 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro Glu Ala Leu Phe 


Phe Val 


65 








70 






75 


80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His Met His Val Leu 


Val Glu 










85 








90 


95 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe Leu Ser 


Gin He 








100 










105 ~ 110 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly He Glu Pro 


Thr Leu 






115 










120 


125 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala Gly Gly Gly 




130 










135 




140 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro Asn Tyr Leu Leu 


Pro Lys 


145 










150 






155 


160 


Thr. 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr Asn Met Glu Gin 


Tyr Leu 










165 








170 


175 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg Lys Arg Leu Val Ala Gin His 
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180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 




210 










215 




Met 


Glu 


Leu 


Val 


Gly Trp 


Leu 


Val 












230 


Gin 




Gin 


Tim 


He 


Gin 


Glu Asp 


Ala 








245 












Ser Arg 


Ser 


Gin 


He 


Lys 








260 












Mpf. 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 














295 




rv»j xx 


Glv 


Tyr Asp 


Pro Gin Tyr Ala 


*inc 








310 






Thr 


uy o 


Lys 


Phe 


Gly 


Lys 


Arg Asn 










325 








X. I IX 


X 11X 


Gly 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Tvr 


Gly Cys 


Val 


Asn Trp 


Thr 




355 










360 




Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


370 










375 




TjVS 
xj y 0 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


J O J 










390 






Val 


AsD 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 










Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 






435 










440 


Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 






515 










520 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


530 










535 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


545 










550 






Phe 


Thr 


His 


Gly 


Gin 


Lys 


Asp 


Cys 










565 








Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 








580 










lie 


His 


His 


He 


Met 


Gly Lys 


Val 






595 










600 


Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 




610 










615 





<210> 134 
<211> 397 
<212> PRT 

<213> Artificial Sequence 



185 










190 






Glu 


Gin 


Asn 


Lys 


Glu 
205 


Asn 


Gin 


Asn 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 






220 










Asp 


Lys 


Gly 


He Thr 


Ser 


Glu 


Lys 






235 










240 


Ser 


Tyr 
250 


He 


Ser 


Phe 


Asn 


Ala 
255 


Ala 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Gly 


Lys 


265 










270 






Pro 


Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 






285 








Arg 


He 


Tyr 


Lys 
300 


He 


Leu 


Glu 


Leu 


Ala 


Ser 


Val 


Phe 


Leu Gly 


Trp 


Ala 






315 










320 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 




330 








335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 
365 


Phe 


Ala 


Asp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 






380 










He Leu Gly 


Gly Ser Lys 


Val 


Arg 






395 










400 


Ala 


Gin 
410 


He 


Asp 


Pro 


Thr 


Pro 
415 


Val 


Cys 


Ala 


Val 


He 


Asp 


Gly 


Asn 


Ser 


425 










430 






Leu 


Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 










445 








Asp 


Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 






460 










Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 




475 










480 


Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 




490 










495 




Pro 


Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 










510 






Glu 


Ala 


Ser 


He 


Asn 
525 


Tyr 


Ala 


Asp 


His 


Val 


Gly 


Met 
540 


Asn 


Leu 


Met 


Leu 


Met 


Asn 


Gin 
555 


Asn 


Ser 


Asn 


He 


Cys 
560 


Leu 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 




570 








575 




Lys 


Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 


585 










590 






Pro 


Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 






605 








Cys 


He 


Phe 


Glu 


Gin 












620 
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<220> 

<223> Mutant rep protein: rep52 367 GCC 



<400> 134 



Met 


Glu 


Leu Val 


Gly 


Trp 


Leu Val 


Asp Lys Gly He 


Thr 


Ser 


Glu 


Lys 


1 






5 








10 








15 




Gin Trp 


He Gin 


Glu 


Asp 


Gin Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 






20 








25 








30 






Ser 


Asn 


Ser Arg 
35 


Ser 


Gin 


He Lys 
40 


Ala 


Ala 


Leu Asp 


Asn 
45 


Ala 


Gly 


Lys 


He 


Met 


Ser Leu 


Thr 


Lys 


Thr Ala 


Pro 


Asp 


Tyr Leu 


Val Gly Gin Gin 




50 








55 






60 










Pro 


Val 


Glu Asp 


lie 


Ser 


Ser Asn 


Arg 


He 


Tyr Lys 


He 


Leu 


Glu 


Leu 


65 






70 








75 








80 


Asn Gly 


Tyr Asp 


Pro 


Gin 


Tyr Ala 


Ala 


Ser 


Val Phe 


Leu Gly Trp Ala 








85 








90 








95 




Thr Lys 


Lys Phe Gly 


Lys 


Arg Asn 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 






100 








105 








110 






Thr 


Thr 


Gly Lys 
115 


Thr 


Asn 


He Ala 
120 


Glu 


Ala 


He Ala 


His 
125 


Thr 


Val 


Pro 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Ala Asp 




130 








135 






140 










Cys 


Val 


Asp Lys 


Met 


Val 


He Trp 


Trp 


Glu Glu Gly 


Lys 


Met 


Thr 


Ala 


145 








150 








155 








160 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys Ala 


He 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 






165 








170 








175 




Val Asp 


Gin Lys 


Cys 


Lys 


Ser Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 






180 








185 








190 






He 


Val 


Thr Ser 


Asn 


Thr 


Asn Met 


Cys Ala Val He 


Asp 


Gly Asn 


Ser 






195 






200 








205 








Thr 


Thr 


Phe Glu 


His 


Gin 


Gin Pro 


Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 




210 








215 






220 










Glu 


Leu 


Thr Arg Arg 


Leu 


Asp His 


Asp 


Phe Gly Lys 


Val 


Thr 


Lys 


Gin 


225 








230 








235 








240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg Trp 


Ala Lys Asp His 


Val 


Val 


Glu 


Val 








245 








250 








255 




Glu 


His 


Glu Phe 


Tyr 


Val 


Lys Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 






260 








265 








270 






Pro 


Ser 


Asp Ala 


Asp 


He 


Ser Glu 


Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 






275 






280 








285 








Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp Ala 


Glu 


Ala 


Ser He 


Asn Tyr Ala Asp 




290 








295 






300 










Arg 


Tyr 


Gin Asn Lys 


Cys 


Ser Arg 


His 


Val Gly Met 


Asn 


Leu 


Met 


Leu 


305 








310 








315 








320 


Phe 


Pro 


Cys Arg 


Gin 


Cys 


Glu Arg 


Met 


Asn 


Gin Asn 


Ser 


Asn 


He 


Cys 






325 






330 








335 




Phe 


Thr 


His Gly Gin 


Lys 


Asp Cys 


Leu Glu Cys Phe 


Pro 


Val 


Ser 


Glu 






340 








345 








350 






Ser 


Gin 


Pro Val 


Ser 


Val 


Val Lys 


Lys Ala Tyr Gin 


Lys 


Leu 


Cys 


Tyr 






355 






360 








365 








He 


His 


His He 


Met 


Gly 


Lys Val 


Pro Asp Ala Cys 


Thr 


Ala 


Cys 


Asp 




370 








375 






380 










Leu 


Val 


Asn Val Asp 


Leu 


Asp Asp 


Cys 


He 


Phe Glu 


Gin 








385 








390 








395 











<210> 135 

<211> 536 

<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Mutant rep protein: rep68 367 GCC 
<400> 135 

Thr Ala Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 * 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 * 110 

Arg Glu Lys Leu lie Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 ~ 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Ala Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 " 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 ~ 460 
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Glu Val Lys Asp Phe Phe Arg Trp 
465 470 
Glu His Glu Phe Tyr Val Lys Lys 

485 

Pro Ser Asp Ala Asp He Ser Glu 

500 

Ala Gin Pro Ser Thr Ser Asp Ala 
515 520 
Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 136 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 367 GCC 



<400> 136 



Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


Asp Lys Gly lie 


Thr Ser 


Glu 


Lys 


1 






5 






10 




15 




Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr He Ser 


Phe Asn 


Ala 


Ala 




20 










25 


30 






Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 


Ala Ala Leu Asp 


Asn Ala 


Gly 


Lys 






35 








40 




45 






He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val Gly Gin Gin 




50 






55 




60 








Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn 


Arg He Tyr Lys 


He Leu 


Glu 


Leu 


65 






70 






75 






80 


Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser Val Phe 


Leu Gly Trp Ala 








85 








90 




95 




Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 


Asn 


Thr He Trp Leu 


Phe Gly 


Pro 


Ala 






100 










105 


110 






Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu Ala He Ala 


His Thr 


Val 


Pro 






115 








120 




125 






Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu Asn Phe 


Pro Phe 


Ala 


Asp 




130 








135 




140 








Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


Trp Glu Glu Gly 


Lys Met 


Thr 


Ala 


145 






150 






155 






160 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly 


Ser Lys 


Val 


Arg 






165 








170 




175 




Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala Gin He Asp 


Pro Thr 


Pro 


Val 






180 










185 


190 






He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala Val lie 


Asp Gly Asn Ser 






195 








200 




205 






Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 


Leu Gin Asp Arg 


Met Phe 


Lys 


Phe 




210 








215 




220 








Glu 


Leu 


Thr Arg 


Arg 


Leu 


Asp 


His 


Asp Phe Gly Lys 


Val Thr 


Lys 


Gin 


225 








230 






235 






240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


Ala Lys Asp His 


Val Val 


Glu 


Val 








245 








250 




255 




Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys Arg 


Pro 


Ala 






260 










265 


270 






Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 


Pro Lys Arg Val 


Arg Glu 


Ser 


Val 






275 








280 




285 






Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu Ala Ser He 


Asn Tyr Ala Asp 




290 








295 




300 








Arg 


Leu 


Ala Arg 


Gly 


His 


Ser 


Leu 










305 








310 















Ala Lys Asp His 
475 

Gly Gly Ala Lys 
490 

Pro Lys Arg Val 
505 

Glu Ala Ser lie 



Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 
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<210> 137 
<211> 621 
<212> PRT 

<213> Artificial Sequence 

<220> m ■ 

<223> Mutant rep*" protein: rep7 8 370 GCC 

<400> 137 

Thr Ala Gly Phe Tyr Glu lie Val He Lys Val Pro Ser Asp Leu Asp 

1*5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

- 50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 ~ 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 

100 105 110 

Arg Glu Lys Leu He Gin Arg He Tyr Arg Gly He Glu Pro Thr Leu 

115 ~ 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Ala Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 
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lie 


Val 


Thr 


Ser Asn 


Thr Asn 


Met 


Cys Ala Val He Asp 


Gly Asn Ser 








420 






425 


430 






Thr 


Thr 


Phe 


Glu His 


Gin Gin 


Pro 


Leu Gin Asp Arg Met 


Phe 


Lys 


Phe 






435 






440 


445 








Glu 


Leu 


Thr 


Arg Arg 


Leu Asp 


His 


Asp Phe Gly Lys Val 


Thr 


Lys 


Gin 




450 




455 




460 








Glu 


Val 


Lys 


Asp Phe 


Phe Arg Trp 


Ala Lys Asp His Val 


Val 


Glu 


Val 


465 






470 




475 






480 


Glu 


His 


Glu 


Phe Tyr 


Val Lys 


Lys 


Gly Gly Ala Lys Lys 


Arg 


Pro 


Ala 








485 






490 




495 




Pro 


Ser 


Asp 


Ala Asp 


He Ser 


Glu 


Pro Lys Arg Val Arg 


Glu 


Ser 


Val 






500 






505 


510 






Ala 


Gin 


Pro 


Ser Thr 


Ser Asp 


Ala 


Glu Ala Ser He Asn 


Tyr Ala Asp 






515 






520 


525 








Arg 


Tyr 


Gin 


Asn Lys 


Cys Ser 


Arg 


His Val Gly Met Asn 


Leu 


Met 


Leu 


530 






535 




540 








Phe 


Pro 


Cys 


Arg Gin 


Cys Glu Arg 


Met Asn Gin Asn Ser 


Asn 


He 


Cys 


545 




550 




555 






560 


Phe 


Thr 


His 


Gly Gin Lys Asp 


Cys 


Leu Glu Cys Phe Pro 


Val 


Ser 


Glu 








565 






570 




575 




Ser 


Gin 


Pro 


Val Ser 


Val Val 


Lys 


Lys Ala Tyr Gin Lys 


Leu 


Cys 


Tyr 








580 






585 


590 






He 


His 


His 


He Met 


Gly Lys 


Val 


Pro Asp Ala Cys Thr 


Ala 


Cys 


Asp 






595 






600 


605 








Leu 


Val 


Asn 


Val Asp 


Leu Asp 


Asp 


Cys He Phe Glu Gin 










610 






615 




620 









<210> 138 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 370 GCC 



<400> 138 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys Gly 


He 


Thr 


Ser Glu Lys 


1 








5 








10 






15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr He 


Ser 


Phe 


Asn Ala Ala 






20 








25 






30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala Leu Asp Asn 


Ala Gly Lys 






35 








40 






45 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr 


Leu 


Val 


Gly Gin Gin 




50 










55 






60 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He Tyr 


Lys 


He 


Leu Glu Leu 


65 








70 






75 






80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser Val 


Phe 


Leu 


Gly Trp Ala 






85 






90 






95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He Trp 


Leu 


Phe 


Gly Pro Ala 








100 










105 






110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu Ala He 


Ala 


His 


Thr Val Pro 






115 








120 






125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu Asn 


Phe 


Pro 


Phe Asn Asp 




130 










135 






140 






Cys 


Ala 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp Glu Glu Gly Lys 


Met Thr Ala 


145 










150 






155 






160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly Ser 


Lys Val Arg 








165 








170 






175 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala Gin He 


Asp 


Pro 


Thr Pro Val 








180 










185 






190 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala Val 


He 


Asp 


Gly Asn Ser 
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195 










200 










205 






Thr Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg Met Phe 


Lys 


Phe 


210 










215 










220 








blu - lieu 


Thr Arg Arg 


Leu Asp 


His 


Asp 


Phe 


Gly Lys 


Val Thr 


Lys 


Gin 


225 








230 










235 








240 


Glu Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala 


Lys Asp His 


Val Val 


Glu 


Val 








245 










250 








255 




Glu His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 


Gly Ala Lys 


Lys Arg 


Pro 


Ala 






260 










265 








270 






Pro Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg 


Val 


Arg Glu 


Ser 


Val 




275 










280 










285 






Ala Gin 


Pro 


Ser 


Thr 


Ser Asp Ala Glu 


Ala 


Ser 


He Asn Tyr Ala Asp 


290 










295 










300 








Arg Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg His 


Val 


Gly Met Asn Leu Met 


Leu 


305 








310 










315 








320 


Phe Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg Met 


Asn 


Gin 


Asn 


Ser Asn 


He 


Cys 






325 










330 








335 




Phe Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu 


Glu 


Cys 


Phe 


Pro Val 


Ser 


Glu 






340 










345 








350 






Ser Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala 


Tyr 


Gin 


Lys Leu Cys 


Tyr 




355 










360 










365 






He His 


His 


He 


Met 


Gly Lys 


Val 


Pro 


Asp 


Ala 


Cys 


Thr Ala 


Cys 


Asp 


370 










375 










380 








Leu Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 


Cys 


He 


Phe 


Glu 


Gin 






385 






390 










395 










<210> 139 


























<211> 536 


























<212> PRT 


























<213> Artificial Sequence 


















<220> 




























<223> Mutant rep protein: rep 6 8 


370 


GCC 












<400> 139 


























Thr Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


Ser Asp 


Leu 


Asp 


1 




5 










10 








15 




Glu His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn 


Trp Val 


Ala 


Glu 






20 






25 








30 






Lys Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp Met 


Asp 


Leu Asn 


Leu 


He 


35 










40 










45 






Glu Gin 


Ala 


Pro 


Leu 




Val 


Ala 


Glu 


Lys 


Leu 


Gin Arg Asp 


Phe 


Leu 


50 










55 










60 








Thr Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


Leu Phe 


Phe 


Val 


65 






70 










75 








80 


Gin Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 


His 


Val Leu 


Val 


Glu 






85 










90 








95 




Thr Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly Arg 


Phe 


Leu Ser 


Gin 


He 




100 










105 








110 






Arg Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 


Arg Gly 


He 


Glu Pro 


Thr 


Leu 




115 










120 










125 






Pro Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg Asn Gly Ala Gly Gly Gly 


130 










135 










140 








Asn Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro Asn Tyr Leu Leu 


Pro 


Lys 


145 








150 










155 








160 


Thr Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 


Met 


Glu Gin Tyr Leu 








165 








170 








175 




Ser Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu Arg 


Lys 


Arg 


Leu 


Val Ala 


Gin 


His 






180 










185 






190 






Leu Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn 


Lys 


Glu Asn 


Gin 


Asn 




195 










200 








205 







WO 03/018820 
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Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 




210 










215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


He 


Gin Glu Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 

mmJ V— J* 


Arg 


Ser 


Gin 


He 


Lys 








i> <J u 










lie 


Met 


Ser 


T.Q1 1 

JJCU 


Thr 


Lys 


Thr 


Ala 






275 








280 


Pro 


Val 


Glu 


nop 


He 


Ser 


Ser 


Asn 




290 








295 




Asn Gly 


Tvr 




Pro Gin Tyr 


Ala 


305 










310 






Thr 


Lys 


TjVS 


Phe 


Gly 


Lys 


Arg 


Asn 










325 








Thr 


Thr 


Glv 


Lys 


Thr 


Asn 


He 


Ala 








340 










Phe 


Tyr 


Glv 


Cys 


Val 


Asn 


Trp 


Thr 






o c c 












Cys Ala 


Asp 


Lys 


Met 


Val 


He 


Trp 




370 










375 




Lys Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 










Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 






435 










440 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg 


Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr Val 


Lys 


Lys 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 






515 










520 


Arg Leu 


Ala 


Arg Gly His 


Ser 


Leu 




530 










535 





<210> 140 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 
<400> 140 

Met Glu Leu Val Gly Trp Leu Val 

1 5 
Gin Trp He Gin Glu Asp Gin Ala 

20 

Ser Asn Ser Arg Ser Gin He Lys 

35 40 
He Met Ser Leu Thr Lys Thr Ala 

50 55 
Pro Val Glu Asp He Ser Ser Asn 



-170- 



Arg 


Ser 


Lys Thr 


Ser 


Ala Arg Tyr 






220 








Asp 


Lys 


Gly He 


Thr 


Ser 


Glu Lys 






235 






240 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala Ala 




250 








255 


Ala 


Ala 


Leu Asp Asn 


Ala 


Gly Lys 


265 








270 




Pro Asp 


Tyr Leu 


Val 


Gly Gin Gin 








285 






Arg 


He 


Tyr Lys 


He 


Leu 


Glu Leu 






300 








Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 






315 






320 


Thr 


He 


Trp Leu 


Phe 


Gly 


Pro Ala 




330 








335 


Glu 


Ala 


He Ala 


His 


Thr 


Val Pro 


345 








350 




Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 








365 






Trp 


Glu 


Glu Gly Lys 


Met 


Thr Ala 






380 








He 


Leu 


Gly Gly Ser 


Lys 


Val Arg 


• 




395 






400 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro Val 




410 








415 


Cys 


Ala 


Val He 


Asp 


Gly Asn Ser 


425 








430 




Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys Phe 








445 






Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys Gin 






460 








Ala 


Lys 


Asp His 


Val 


Val 


Glu Val 




475 






480 


Gly Gly 


Ala Lys 


Lys 


Arg 


Pro Ala 




490 








495 


Pro 


Lys 


Arg Val Arg 


Glu 


Ser Val 


505 








510 




Glu 


Ala 


Ser He 


Asn 


Tyr Ala Asp 








525 






370 


GCC 










Asp 


Lys 


Gly He 


Thr 


Ser 


Glu Lys 


10 








15 


Ser Tyr 


He Ser 


Phe 


Asn 


Ala Ala 


25 








30 




Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 








45 






Pro 


Asp 


Tyr Leu 


Val 


Gly Gin Gin 






60 








Arg 


He 


Tyr Lys 


He 


Leu 


Glu Leu 
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65 








70 






75 




80 


Asn Gly Tyr Asp 


Pro 


Gin Tyr 


Ala 


Ala Ser Val Phe 


Leu 


Gly Trp Ala 








85 








90 




95 


Thr Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He Trp Leu 


Phe 


Gly Pro Ala 


100 










105 




110 


Thr Thr Gly Lys 


Thr 


Asn 


He 


Ala 


Glu Ala He Ala 


His 


Thr Val Pro 




115 










120 




125 




Phe Tyr Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu Asn Phe 


Pro 


Phe Asn Asp 


130 










135 




140 






Cys Ala 


Asp 


Lys 


Met 


Val 


He 


Trp Trp Glu Glu Gly 


Lys 


Met Tnr Ala 


145 








150 






155 




160 


Lys Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly Gly 


Ser 


Lys Val Arg 






165 








170 




175 


Val Asp Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala Gin lie Asp 


Pro 


Thr Pro Val 






180 










185 




1 flA 

190 


He Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala Val He 


Asp 


Gly Asn Ser 




195 










200 




205 




Thr Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp Arg 


Met 


FJie lays Fne 


210 










215 




220 






Glu Leu Thr Arg 


Arg 


Leu Asp 


His 


Asp Phe Gly Lys 


Val 


Thr Lys Gin 


225 








230 






235 




240 


Glu Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala Lys Asp His 


Val 


Val Glu Val 




245 








250 




255 


Glu His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg Pro Ala 






260 










265 




270 


Pro Ser Asp 


Ala 


Asp 


He 


Ser 


Glu Pro Lys Arg Val Arg 


Glu Ser VaX 




275 










280 




285 




Ala Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu Ala Ser He 


Asn 


Tyr Ala Asp 


290 










295 




300 






Arg Leu 


Ala 


Arg 


Gly 


His 


Ser 


Leu 








305 








310 












<210> 141 


















<211> 621 


















<212> PRT 


















<213> Artificial Sequence 










<220> 




















<223> Mutant rep protein: rep78 


376 GCG 






<400> 141 


















Thr Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys Val Pro 


Ser 


Asp Leu Asp 


1 




5 








10 




15 


Glu His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser Phe Val Asn 


Trp 


Val Ala Glu 






20 








25 




30 


Lys Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp 


Leu 


Asn Leu He 




35 










40 




45 




Glu Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys Leu Gin Arg 


Asp Phe Leu 


50 










55 




60 






Thr Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro Glu Ala 


Leu 


Phe Phe Val 


65 






70 






75 




80 


Gin Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His Met His 


Val 


Leu Val Glu 






85 






90 




95 


Thr Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe 


Leu 


Ser Gin He 






100 










105 




110 


Arg Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly He 


Glu 


Pro Thr Leu 




115 










120 




125 




Pro Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala 


Gly Gly Gly 


130 










135 




140 






Asn Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro Asn Tyr 


Leu 


Leu Pro Lys 


145 








150 






155 




160 
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Thr 


Gin 


Pro 


Glu Leu Gin Trp Ala 










165 








Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 




Asn 


Ser 


Asp Ala Pro 


Val 


He 




^* -i- \j 










215 




Met 


Glu 


Leu 


Val Gly Trp 


Leu 


Val 












230 






uJ.il 


lip 


Tie* 
x x c 


Gin Glu Asp 


Gin 


Ala 








245 








Coy 


7\ en 
noil 


ij CJ_ 


Arg 


Ser 


Gin 


He 


Lys 








260 










Tie 


i*i\ — i*. 


Cpr* 


Leu 


Thr 


Lys 


Thr 


Ala 
















280 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 




9 Qn 








295 






OX 


xyxr 


Asp 


Pro 


Gin 


Tyr 


Ala 


one 










310 






Thr 

X Jll-U 


u y »z> 


TjVS 


Phe 


Gly Lys 


Arg Asn 










325 










X 11X 


vj-i. _y 


Lys 


Thr 


Asn 


He 


Ala 








340 










phe 

X7 J. X t^. 


TVT 
xy x 


Glv 


Cys 


Val 


Asn 


Trp 


Thr 














360 


Cvs 


Val 


Asd 


Lys 


Met 


Val 


He 


Ala 




370 










375 




Lvs 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


w> (_> —J 










390 






Val 


x* £J 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 










405 








lie 

>X» 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 








420 












Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 






435 










440 


Glu 


Leu 


Thr 


Arg Arg 


Leu 


Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 










485 








Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 








500 










Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 






515 










520 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg 




530 










535 




Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


545 










550 






Phe 


Thr 


His 


Gly Gin Lys 


Asp 


Cys 










565 








Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 








580 










He 


His 


His 


He 


Met 


Gly Lys 


Val 






595 










600 


Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp Asp 




610 










615 





Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 










175 




Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 


185 










190 






Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 


* - 






205 








Arg 


Ser 


Lys 


Thr 
220 


Ser 


Ala 


Arg 


Tyr 


Asp 


Lys 


Gly 
235 


He 


Thr 


Ser 


Glu 


Lys 
240 


Ser 


Tyr 
250 


He 


Ser 


Phe 


Asn 


Ala 
255 


Ala 


Ala 


Ala 


Leu Asp 


Asn Ala Gly Lys 


265 










270 






Pro Asp 


Tyr 


Leu 


Val Gly Gin Gin 










285 








Arg 


He 


Tyr 


Lys 
300 


He 


Leu 


Glu 


Leu 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp 


Ala 






315 










320 


Thr 


He 


Trp 


Leu 


Phe Gly Pro Ala 




330 








335 




Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn Asp 










365 








Trp 


Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 








380 










He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 






395 










400 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 




410 








415 




Cys 


Ala 


Val 


He 


Asp 


Gly Asn 


Ser 


425 










430 






Leu 


Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 










445 








Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 








460 










Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 




475 










480 


Gly Gly 


Ala 


Lys 


Lys Arg 


Pro 


Ala 




490 










495 




Pro 


Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 










510 






Glu 


Ala 


Ser 


He 


Asn 
525 


Tyr 


Ala 


Asp 


His 


Val 


Gly Met 


Asn 


Leu 


Met 


Leu 








540 










Met 


Asn 


Gin 
555 


Asn 


Ser 


Asn 


He 


Cys 
560 


Leu 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 




570 








575 




Lys 


Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 


585 










590 






Pro 


Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 








605 








Cys 


He 


Phe 


Glu 
620 


Gin 









<210> 142 
<211> 397 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 52 376 GCG 



<400> 142 



Met 


Glu 


Leu Val 


f21 \r Tm T.^n Val 
kjj x_y -L x.^j J-t~ la vax 


Asp 


Lys 


Gly 


He 


Thr 


Ser Glu Lys 


1 








5 




10 








15 


Gin Trp 


He 


Gin 


UiU t\0}J Ulil AlCt 


Ser 


Tyr 


He 


Ser 


Phe 


Asn Ala Ala 








20 


25 










30 


Ser 


Asn 


Ser Arg 


oci ij-Lii ij.c uys 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala Gly Lys 






35 




40 










45 




lie 


Met 


Ser 


Leu 


•yVf*" T.\rc* T'VlT* 111 a 

x xi-L uyp xxxx rt.xd 


Pro Asp 


Tyr 


Leu 


Val Gly Gin Gin 




50 














60 






Pro 


Val 


Glu Asp 


T ~) j=> Qov Cq-v- ZXOTI 
11C OCX. uCl iTVOil 


Arg 


He 


Tyr 


Lys 


He 


Leu Glu Leu 


65 








70 






75 






80 


Asn Gly 


Tyr Asp 


f 1U Ulil X Jr X. nld 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp Ala 










R5 




90 








95 


Thr Lys 


Lys 


Phe 


KsXy JjyS Axy Hbii 


Thr 


He 


Trp 


Leu 


Phe Gly Pro Ala 






100 




105 










110 


Thr 


Thr 


Gly Lys 


Thr A en Tl P A la 
xxix nsU' x xc nio 


Glu 


Ala 


He 


Ala 


His 


Thr Val Pro 






115 




120 

A* w 










125 




Phe 


Tyr 


Gly Cys 


Val Sen Tm TVir 
V ctx AoU lip x xxx 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe Asn Asp 




130 






135 








140 






Cys 


Val 


Asp 


Lys 


Met- Val Tie Ala 


Trp 


Glu 


Glu 


Gly 


Lys 


Met Thr Ala 


145 








150 






155 






160 


Lys 


Val 


Val 


Glu 


Cpy &1 a TiV«5 Al 3 


lie 


Leu Gly Gly 


Ser 


Lys Val Arg 








165 




170 








175 


Val Asp 


Gin 


Lys 


Cys Lys Ser Ser 


Ala 


Gin 


He 


Asp 


Pro 


Thr Pro Val 








180 




185 










190 


lie 


Val 


Thr 


Ser 


Asn Thr Asn Met 


Cys 


Ala 


Val 


He 


Asp 


Gly Asn Ser 






195 




200 










205 




Thr 


Thr 


Phe 


Glu 


His Gin Gin Pro 


Leu Gin Asp Arg 


Met 


Phe Lys Phe 




210 






215 








220 






Glu 


Leu 


Thr Arg 


Arg Leu Asp His 


Asp 


Phe 


Gly 


Lys 


Val 


Thr Lys Gin 


225 








230 






235 






240 


Glu 


Val 


Lys 


Asp 


Phe Phe Arg Trp 


Ala 


Lys 


Asp 


His 


Val 


Val Glu Val 






245 




250 








255 


Glu 


His 


Glu 


Phe 


Tyr Val Lys Lys 


Gly Gly Ala Lys 


Lys 


Arg Pro Ala 








260 




265 










270 


Pro 


Ser 


Asp 


Ala 


Asp lie Ser Glu 


Pro 


Lys Arg 


Val 


Arg 


Glu Ser Val 






275 




280 










285 




Ala 


Gin 


Pro 


Ser 


Thr Ser Asp Ala 


Glu 


Ala 


Ser 


He 


Asn Tyr Ala Asp 




290 






295 








300 






Arg Tyr 


Gin 


Asn 


Lys Cys Ser Arg 


His 


Val 


Gly Met 


Asn 


Leu Met Leu 


305 








310 






315 






320 


Phe 


Pro 


Cys 


Arg 


Gin Cys Glu Arg 


Met 


Asn 


Gin 


Asn 


Ser 


Asn He Cys 








325 




330 








335 


Phe 


Thr 


His 


Gly 


Gin Lys Asp Cys 


Leu 


Glu 


Cys 


Phe 


Pro 


Val Ser Glu 








340 


345 










350 


Ser 


Gin 


Pro 


Val 


Ser Val Val Lys 


Lys 


Ala 


Tyr 


Gin 


Lys 


Leu Cys Tyr 






355 




360 








365 




He 


His 


His 


He 


Met Gly Lys Val 


Pro 


Asp 


Ala 


Cys 


Thr 


Ala Cys Asp 




370 






375 








380 






Leu 


Val 


Asn 


Val 


Asp Leu Asp Asp 


Cys 


He 


Phe 


Glu 


Gin 




385 








390 






395 









<210> 143 
<211> 536 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 376 GCG 



<400> 143 



inr 


Ala 


f2l ir 


xr lifer 


Tyr 


m ii 

UJ. u. 


He 


Val 

V CX -Z- 


He 


Lys 


Val Pro 


Ser Asp 


Leu 


As id 

.AX*J * 


1 








c 










10 








X — > 




bill 


His 


lieu 


.fro 


vaiy 


He 


Cav 
OC1 




Ser 


Phe 


Val Asn 


Trp 


Val 


Ala 


Glu 

KJ _L* IA 








20 








25 








30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp 


Met Asp 


Leu 


Asn 


Leu 


He 




35 


















45 








olU 


Li ±11 


Ala 


±ri O 


T i 


Thr 


Ua 1 

V Ct-L 


rLLct 


Glu 


Lys 


Leu Gin Arg Asp 


Phe 


Leu. 




50 


















60 












bill 


irp 


Arg 




v d -L 


OCX. 


U jr o 


Ala 


Pro 


Glu Ala 


Leu 


Phe 


Phe 

X. X X V— • 


Val 


65 




/ U 










75 










bin 


Phe 


Glu 


Lys 


varXy 




OCi 




Phe 


His 


Met His 


Val 


Leu 


Val 


Glu 








o c 










90 












mr 


± III 


ni ir 


T7-3 T 


T Affl 

Liy t> 




nc <— 


v d~L. 


Leu Gly 


Arg Phe 


Leu 


Ser 


Gin 


He 






lOO 








105 








110 






Arg 


VjIU 


Lys 


Leu 


lie 


f2l ti 

uxn 




Tl f=» 
J. xtr 


Tyr Arg 


Gly He 


Glu 


Pro 


Thr 

-X X 1-1^ 


X*J v_ <wX 




115 










ion 








125 








riO 


Asn 


irp 


Phe 


Aid 


V ctx 


nix 




Thr Arg 


Asn Gly Ala Gly 


Glv 


Glv 




13 0 








X J D 








140 










Asn 


T .ire 

iiyb 


Val 


Val 




C5lti 
vjj _i_ u 


vy & 


Tvr 
xyx 


He 


Pro 


Asn Tyr 


Leu 


Leu 


Pro 


Lvs 










X3v 










155 








160 


inr 


m i-i 


rlQ 


rsl it 




m n 

V3X11 




Ala 


Trp 


Thr 


Asn Met 


Glu 


Gin 


Tvr 


Leu 

*A 


















170 








175 






Ala 




Leu 






ThT 


Glu 


Arg 


Lys 


Arg Leu 


Val 


Ala 


Gin 


His 






XOv 










185 








190 






Leu 


Thr 


His 

173 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn Lys 


Glu 
205 


Asn 


Gin 


Asn 










Ala 


Pro 

X. JL. w 


Val 


He 


Arg 


Ser 


Lys Thr 


Ser 


Ala 


Arg 


Tyr 

-i 




210 








215 








220 










Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


225 








230 










235 








240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 








245 








250 








255 




Ser 


Asn 


Ser 


Arg 
260 


Ser 


Gin 


He 


Lys 


Ala 
265 


Ala 


Leu Asp 


Asn 


Ala 
270 


Gly 


Lys 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr Leu 


Val 


Gly 


Gin 


Gin 






275 








280 








285 








Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




290 








295 








300 










Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu Gly 


Trp 


Ala 


305 




310 










315 








320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu Phe Gly 


Pro 


Ala 










325 










330 








335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr 


Val 


Pro 






340 










345 








350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 




355 








360 








365 








Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Ala 


Trp 


Glu 


Glu Gly Lys 


Met 


Thr 


Ala 


370 








375 








380 










Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly Ser Lys 


Val 


Arg 


385 










390 










395 








400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 






405 










410 








415 




He 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Ala 


Val He 


Asp 


Gly 
430 


Asn 


Ser 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp Arg Met 


Phe 


Lys 


Phe 
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435 




440 






445 








Glu 


Leu 
450 


Thr Arg Arg 


Leu Asp 
455 


His 


Asp 


Phe Gly Lys 
460 


Val 


Thr 


Lys 


Gin 


Glu 


Val 


Lys Asp Phe 


Phe Arg 


Trp 


Ala 


Lys Asp His 


Val 


Val 


Glu 


Val 


465 




470 






475 








480 


Glu 


His 


Glu Phe Tyr 


Val Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 






485 








490 






495 




Pro 


Ser 


Asp Ala Asp 
500 


lie Ser 


Glu 


Pro 
505 


Lys Arg Val 


Arg 


Glu 
510 


Ser 


Val 


Ala 


Gin 


Pro Ser Thr 
515 


Ser Asp 


Ala 
520 


Glu 


Ala Ser lie 


Asn 
525 


Tyr 


Ala 


Asp 


Arg 


Leu 
530 


Ala Arg Gly 


His Ser 
535 


Leu 















<210> 144 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 40 376 GCG 



<400> 144 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser Glu Lys 


1 








5 










10 






15 


Gin 


Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn Ala Ala 






20 










25 








30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 






35 










40 








45 




lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 


Gly Gin Gin 




50 










55 








60 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr Lys 


He 


Leu Glu Leu 


65 










70 










75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 










85 










90 






95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly Pro Ala 








100 










105 








110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr Val Pro 






115 








120 








125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Asn Asp 




130 










135 








140 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Ala 


Trp 


Glu 


Glu Gly 


Lys 


Met Thr Ala 


145 










150 










155 




160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly 


Ser 


Lys Val Arg 










165 










170 






175 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr Pro Val 








180 










185 








190 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val He 


Asp 


Gly Asn Ser 






195 










200 






205 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp Arg 


Met 


Phe Lys Phe 




210 










215 








220 






Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly Lys 


Val 


Thr Lys Gin 


225 










230 










235 




240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys 


Asp His 


Val 


Val Glu Val 










245 










250 




255 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 


Gly 


Ala Lys 


Lys 


Arg Pro Ala 








260 










265 








270 


Pro 


Ser 


Asp 


Ala 


Asp 


lie 


Ser 


Glu 


Pro 


Lys 


Arg Val 


Arg 


Glu Ser Val 






275 










280 








285 




Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser He 


Asn 


Tyr Ala Asp 




290 










295 








300 
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Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 145 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 78 381 GCG 



<400> 145 



Thr 


Ala 


Gly 


Pne 


Tyr 


Glu 


lie 


vai 


lie 


Lys 


vai 


Pro 


Ser 


Asp 


Leu Asp 


1 






5 










10 










15 


Glu 


His 


Leu 


Pro 


Gly 


Ti- 
lls. 


oer 


ASp 


(T* tm^n 

oer 


rne 


Val 


Asn Trp 


Val 


Ala Glu 








20 








25 










30 




Lys 


Glu 


Trp 


GlU 


Leu 


mm m 

fro 


tr ro 


ASp 


oer 


ASp 


Meat* 


Asp 


Leu 


Asn 


Leu He 




35 










40 










45 






Glu 


Gin 


7\ "I _ 

Ala 


Pro 


lieu 


inr 


vai 


Aid 


G1U 


jjys 


iieu 


Gin Arg 


Asp 


Phe Leu 




50 










55 










60 








Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


iiys 


Ala 


Pro 


GIU 


Ala 


Leu 


Phe 


Phe Val 


65 




70 










75 








80 


.Gin 


Phe 


Glu 


Lys 


Gly 


GlU 


faer 


lyr 


trne 


ill 5 


riec 


His 


Val 


Leu 


Val Glu 








85 










90 










95 


Thr 


Thr 


Giy 


vai 


iiys 


O ^-fc "W* 

oer 




Vdl 


Lieu 




Arg 


Phe 


Leu 


Ser 


Gin He 






100 








105 










110 




Arg 


Glu 


115 


DC Kl 


Tie 

JL J. CS 






Tie 

JL JU 

120 


Tvr 




Glv 


He 


Glu 
125 


Pro 


Thr Leu 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg 


Asn 


Gly Ala 


Gly Gly Gly 




130 










135 










140 








Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro Lys 


145 








150 










155 








160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr Leu 










165 






170 










175 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin His 




.• 


180 










185 










190 




Leu 


Thr 


His 
195 


Val 


Ser 


Gin 


Thr 


Gin 
200 


Glu 


Gin 


Asn 


Lys 


Glu 
205 


Asn 


Gin Asn 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala Arg Tyr 




210 










215 










220 








Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu Lys 


225 










230 










235 








240 


Gin Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala Ala 










245 










250 










255 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala Gly Lys 








260 










265 










270 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 






275 








280 










285 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu Leu 




290 










295 




Ala 






300 








Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ser 


Val 


Phe 


Leu 


Gly Trp Ala 


305 










310 










315 








320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe 


Gly Pro Ala 










325 










330 










335. 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val Pro 






340 










345 










350 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe Asn Asp 




355 










360 










365 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu 


Gly Ala 


Met 


Thr Ala 


370 










375 










380 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly 


Gly Ser 


Lys Val Arg 



WO 03/018820 



PCT/IB02/04087 



-177- 



385 








390 






395 






400 


Val 


Asp 


Gin Lys 


Cys 


Lys Ser 


Ser Ala 


Gin 


He Asp Pro 


Thr 


Pro 


Val 




405 






410 






415 




lie 


Val 


Thr Ser 


Asn 


Thr Asn 


Met Cys 


Ala 


Val He Asp 


Gly Asn 


Ser 






420 






425 






430 






Thr 


Thx 


Phe Glu 


His 


Gin Gin 


Pro Leu 


Gin 


Asp Arg Met 


Phe 


Lys 


Phe 






435 






440 




445 








Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His Asp 


Phe 


Gly Lys Val 


Thr Lys Gin 




450 






455 






460 








Glu 


Val 


Lys Asp 


Phe 


Phe Arg 


Trp Ala Lys 


Asp His Val 


Val 


Glu 


Val 


465 






470 






475 






480 


Glu 


His 


Glu Phe 


Tyr 


Val Lys 


Lys Gly Gly 


Ala Lys Lys 


Arg 


Pro 


Ala 








485 






490 






495 




Pro 


Ser 


Asp Ala Asp 


lie Ser 


Glu Pro 


Lys 


Arg Val Arg 


Glu 


Ser 


Val 






500 






505 






510 






Ala 


Gin 


Pro Ser 


Thr 


Ser Asp 


Ala Glu 


Ala 


Ser He Asn 


Tyr Ala Asp 






515 






520 




525 








Arg 


Tyr 


Gin Asn 


Lys 


Cys Ser 


Arg His Val 


Gly Met Asn 


Leu 


Met 


Leu 




530 






535 






540 








Phe 


Pro 


Cys Arg 


Gin 


Cys Glu 


Arg Met 


Asn 


Gin Asn Ser 


Asn 


He 


Cys 


545 






550 






555 






560 


Phe 


Thr 


His Gly Gin 


Lys Asp 


Cys Leu 


Glu 


Cys Phe Pro 


Val 


Ser 


Glu 








565 






570 






575 




Ser 


Gin 


Pro Val 


Ser 


Val Val 


Lys Lys 


Ala 


Tyr Gin Lys 


Leu 


Cys 


Tyr 






580 






585 






590 






lie 


His 


His lie 


Met 


Gly Lys 


Val Pro 


Asp 


Ala Cys Thr 


Ala 


Cys 


Asp 






595 






600 




605 








Leu 


Val 


Asn Val 


Asp 


Leu Asp 


Asp Cys 


He 


Phe Glu Gin 










610 






615 






620 









<210> 146 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 52 3 81 GCG 



<400> 146 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser Glu Lys 


1 








5 










10 






15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn Ala Ala 






20 










25 








30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 






35 








40 








45 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 


Gly Gin Gin 




50 








55 








60 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr Lys 


He 


Leu Glu Leu 


65 








70 








75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 








85 










90 






95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly Pro Ala 






100 








105 








110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr Val Pro 






115 








120 








125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Asn Asp 




130 








135 








140 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp' 


Trp 


Glu 


Glu Gly Ala 


Met Thr Ala 


145 










150 










155 




160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly Ser 


Lys Val Arg 








165 










170 






175' 



WO 03/018820 



PCT/IB02/04087 



-178- 



Val 


AST3 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 






180 










185 










190 








Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val 


He Asp Gly 


Asn 


Ser 






195 










200 










205 








Til T* 




Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp 




Met 


Phe 


Lys 


Phe 




210 










215 










yj 










Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp 


Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 


225 










230 










235 










240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys 


Asn 


His 


Val 


Val 


Glu 


Val 










245 










250 










255 




Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 








260 










265 










270 






Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys 




vai 


Arg 


Glu 


Ser 


Val 






275 










280 










285 








Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 


Ala 


Ser 


lie 


Asn Tyr 


Ala Asp 




290 










2 95 










T A rv 










Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg 


His 


Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 


305 








310 










315 










320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg Met 


Asn 


Gin 


Asn 


Ser 


Asn 


He 


Cys 






325 










330 










335 




Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 








340 










345 










350 






Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 






355 










360 










365 








He 


His 


His 


He 


Met 


Gly 


Lys 


Val 


Pro 


Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 




370 










375 










380 










Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 


Cys 


He 


Phe 


Glu 


Gin 








385 










390 










395 












<210> 147 




























<211> 536 




























<212> PRT 




























<213> Artificial Sequence 




















<220> 






























<223> Mutant rep protein: rep 68 381 GCG 












<400> 147 




























Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


Ser Asp 


Leu Asp 


1 






5 










10 










15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn 


Trp 


Val 


Ala 


Glu 








20 








25 










30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 




35 










40 










45 








Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin Arg Asp 


Phe 


Leu 




50 










55 










60 










Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


Val 


65 






70 










75 










80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 


His 


Val 


Leu 


Val 


Glu 








85 










90 










95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly 


Arg 


Phe 


Leu 


Ser 


Gin 


He 








100 










105 










110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 


Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 




115 










120 










125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg 


Asn 


Gly Ala Gly 


Gly Gly 




130 








135 










14 0 










Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


145 








150 










155 










160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 










165 










170 










175 




Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 
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180 








185 






190 






Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin Glu Gin 


Asn Lys 


Glu 


Asn 


Gin 


Asn 






195 










200 




205 








Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He Arg Ser Lys Thr 


Ser 


Ala 


Arg 


Tyr 




210 








215 




220 










Met 


Glu 


Leu 


Val 


Gly Trp 


Leu 


Val Asp Lys Gly He 


Thr 


Ser 


Glu 


Lys 


225 










230 






235 








240 


Gin 


Trp 


He 


Gin Glu Asp 


Gin 


Ala Ser Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 








245 






250 








255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys Ala Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 








260 








265 






270 






lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala Pro Asp 


Tyr Leu 


Val 


Gly 


Gin 


Gin 






275 








280 




285 








Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




290 








295 




300 










Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala Ala Ser 


Val Phe 


Leu Gly 


Trp 


Ala 


305 






310 






315 








320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn Thr He 


Trp Leu 


Phe Gly 


Pro 


Ala 






325 






330 








335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala Glu Ala 


He Ala 


His 


Thr 


Val 


Pro 






340 








345 






350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr Asn Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 




355 








360 




365 








Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp Trp Glu Glu Gly 


Ala 


Met 


Thr 


Ala 


370 








375 




380 










Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala He Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


385 










390 






395 








400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser Ala Gin 


He Asp 


Pro 


Thr 


Pro 


Val 






405 






410 








415 




He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met Cys Ala 


Val He 


Asp 


Gly 


Asn 


Ser 








420 








425 






430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






435 










440 




445 








Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His Asp Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




450 










455 




460 










Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp Ala Lys 


Asp His 


Val 


Val 


Glu 


Val 


465 






470 






475 








480 


Glu 


His 


Glu 


Ehe Tyr Val 


Lys 


Lys Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 










485 






490 








495 




Pro 


Ser 


Asp 


Ala Asp 


He 


Ser 


Glu Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 






500 








505 






510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala Glu Ala 


Ser He 


Asn 


Tyr 


Ala 


Asp 






515 








520 




525 








Arg 


Leu 


Ala 


Arg Gly His 


Ser 


Leu 














530 










535 















<210> 148 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 381 GCG 
<400> 148 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

1 5 10 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 .30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
35 40 45 
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He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val Gly Gin Gin 




50 








55 




60 








Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn Arg He Tyr X»ys 


He Leu 


Glu 


Leu 










70 






75 






80 


Asn 


Glv 


Tyr Asp 


Pro 


Gin Tyr 


Ala 


Ala Ser Val Phe 


Leu Gly Trp Ala 








85 








90 




95 




X I IX. 


T.VR 
XJ Jr O 


Lys Phe Gly 


Lys 


Arg 


Asn 


Thr He Trp Leu Phe Gly 


Pro 


Ala 






100 










105 


110 






T 1 Vl -v- 


X ixx 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu Ala He Ala 


His Thr 


Val 


Pro 






115 








120 




125 








±yr 


Gly Cys 


Val 


Asn Trp 


Thr 


Asn Glu Asn Phe 


Pro Phe 


Asn 


Asp 




li U 








135 




140 








Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp Trp Glu Glu Gly Ala Met 


Thr 


Ala 


145 








150 






155 






160 


Lys 


Val 


Val Glu 


Ser 


Ala Lys 


Ala 


He Leu Gly Gly Ser Lys 


Val 


Arg 








165 








170 




175 




Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala Gin He Asp 


Pro Thr 


Pro 


Val 






180 










185 


190 






He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala Val He 


Asp Gly Asn 


Ser 






195 








200 




205 






Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp Arg Met Phe 


Lys 


Phe 




210 








215 




220 








Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 


Asp Phe Gly Lys 


Val Thr 


Lys 


Gin 


225 








230 






235 






240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp Ala Lys Asp His 


Val Val 


Glu 


Val 








245 








250 




255 




Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys Arg 


Pro 


Ala 






260 










265 


270 






Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro Lys Arg Val 


Arg Glu 


Ser 


Val 






275 








280 




285 






Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu Ala Ser He 


Asn Tyr 


Ala 


Asp 




290 








295 




300 








Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 










305 








310 















<210> 149 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 382 GCG 



<400> 149 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val He Lys Val 


Pro 


Ser Asp 


Leu 


Asp 


1 








5 






10 






15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp Ser Phe Val 


Asn 


Trp Val 


Ala 


Glu 








20 






25 




30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp Ser Asp Met 


Asp 


Leu Asn 


Leu 


He 






35 










40 




45 






Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala Glu Lys Leu 


Gin 


Arg Asp 


Phe 


Leu 




50 










55 


60 








Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys Ala Pro Glu 


Ala 


Leu Phe 


Phe 


Val 


65 










70 




75 








80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr Phe His Met 


His 


Val Leu 


Val 


Glu 










85 






90 






95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val Leu Gly Arg 


Phe 


Leu Ser 


Gin 


He 








100 








105 




110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly 


He 


Glu Pro 


Thr 


Leu 






115 










120 




125 






Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn Gly Ala Gly Gly Gly 
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130 



Asn 


Lys 


val 


vai 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


_ mm 

Val 


225 








Gin 




lie 


Gin 


Ser 


Asn 


Ser 


Ara 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 

J— J- ' 


Val 


Glu 


ASD 

***** 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lvs 


Lvs 

A"Jr *m* 


Phe 


Thr 


Thr 


Glv 


Lvs 

JT 








340 


Phe 


Tvr 


Gly 

Jr 


Cys 






355 




Cvs 


Val 


Asp 


Lvs 

J: 




370 






Lys 


Val 


mm « ^ 

Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


lie 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr 


Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


Gin 


Asn 




530 






Phe 


Pro 


Cys 


Arg 


545 








Phe 


Thr 


His 


Gly 


Ser 


Gin 


Pro 


Val 








580 


lie 


His 


His 


He 






595 




Leu 


Val 


Asn 


Val 



610 



135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 

Asp Leu Asp Asp 
615 







140 


He 


Pro 


Asn Tyr 






155 


Trp 

mm 


Thr 


Asn Met 


170 




Arg 


Lys 


Arg Leu 


185 






Glu 


Gin 


Asn Lys 

** 


Arg 

— * 


Ser 


Lys Thr 






220 


Asp 


Lys 


Gly He 






235 


Ser 


Tyr 


He Ser 




250 




Ala 


Ala 


Leu Asp 


mm 

265 






Pro 


Asp 

m» 


Tyr Leu 

m> 


Arg 


He 


Tyr Lys 

■mm ml 






300 


Ala 


Ser 


Val Phe 






315 


Thr 


He 


Trp Leu 

mm7 




330 




Glu 


Ala 


He Ala 


345 






Asn 


Glu 


Asn Phe 


Trp 


Glu 


Glu Gly 






380 


He 


Leu 


Gly Gly 






395 


Ala 


Gin 


He Asp 




410 




Cys 


Ala 


Val He 


A 1 C 

425 






Leu 


Gin 


Asp Arg 

mmt mmW 


Asp 


Phe 


Gly Lys 






460 


Ala 


Lys 


Asp His 






475 


Gly 


Gly 


Ala Lys 




490 




Pro 


Lys 


Arg Val 


505 






Glu 


Ala 


Ser He 


His 


Val 


Gly Met 






540 


Met 


Asn 


Gin Asn 






555 


Leu 


Glu 


Cys Phe 




570 




Lys 


Ala 


Tyr Gin 


585 






Pro 


Asp 


Ala Cys 


Cys 


He 


Phe Glu 




620 



Leu 


Leu 


Pro 


Lys 








160 


Glu 


Gin 


Tyr 


Leu 






175 




Val 


Ala 


Gin 


His 




190 






-mm -m 

Glu 


Asn 


-I mm 

Gin 


Asn 


205 








Ser 


Ala 


Arcr 


Tvr 


Thr 

mm* imm m*mm^ 


Ser 


Glu 


Lvs 

"Jr 








240 


Phe 


Asn 


Ala 


Ala 






255 




Asn 


Ala 


Gly 


Lvs 




270 






Val 


Gly 


Gin 


Gin 


285 

mmm -mm* 








He 

*mm —mm >— * 


Leu 


Glu 


Leu 


Leu 


Glv 


Trn 


Ala 








320 


Phe 


Gly 

JT 


Pro 


Ala 






335 




His 


Thr 


Val 


Pro 




350 






Pro 


Phe 


Asn 


Asp 


365 








Lvs 


Ala 


Thr 


Ala 


Ser 


Lys 

■ml 


Val 


Arg 








400 


Pro 


Thr 


Pro 


Val 






415 




Asp 


Gly 


Asn 


Ser 




430 






Met 


Phe 


Lys 


Phe 


445 








Val 


Thr 


Lys 


Gin 


Val 


Val 


Glu 


Val 








480 


Lys 


Arg 


Pro 


Ala 






495 




Arg 


Glu 


Ser 


Val 




510 






Asn 


Tyr 


Ala 


Asp 


525 








Asn 


Leu 


Met 


Leu 


Ser 


Asn 


He 


Cys 








560 


Pro 


Val 


Ser 


Glu 






575 




Lys 


Leu 


Cys 


Tyr 




590 






Thr 


Ala 


Cys 


Asp 



605 
Gin 



WO 03/018820 



PCT/IB02/04087 



-182- 



<210> 150 

<211> 397 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 52 3 82 GCG 



<400> 150 



Met 


Glu 


Leu Val Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


1 




5 










±U 








15 




Gin 


Trp 


He Gin Glu 


Asp 


Gin 


TV 1 _ 

Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 




20 














"a rt 






Ser 


Asn 


Ser Arg Ser 


Gin 


He 


Lys 


Ala 


"TV 1 _. 

Ala 


Leu Asp 


Asn 


7\ "1 -k 

Ala 


Gly Lys 






35 






40 








45 








lie 


Met 
50 


Ser Leu Thr 


Lys 


Thr 

c c 
bb 


Ala 


Pro 


TV __«. 

Asp 


Tyr Leu 
60 


Val 


. vsiy 


Gin 


Gin 


Pro 


Val 


Glu Asp He 


Ser 


Ser 


Asn 


Arg 


lie 


Tyr Lys 


He 


.Lieu 


Glu 


Leu 


65 




70 










75 








80 


Asn 


Gly 


Tyr Asp Pro 


Gin 


Tyr 


TV 1 — 

Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly 


Trp 


Ala 




Q IT 

85 










ort 








95 




Thr 


Lys 


Lys Phe Gly 


Lys 


Arg 


Asn 


ml 

Thr 


lie 


Trp Leu 


Phe 


tiiy 


Pro 


Ala 




100 








±05 








1 1 rt 

1JLU 






Thr 


Thr 


Gly Lys Thr 
115 


Asn 


He 


Ala 

l<s U 


Glu 


AXa 


He Ala 


His 
125 


Thr 


Val 


Pro 


Phe 


Tyr 


Gly Cys Val 


Asn 


Trp 


Thr 


^\ MIT. 

Asn 


1?1U 


Asn Phe 


Pro 


irlie 


Asn Asp 
















140 










Cys 


Val 


Asp Lys Met 


Val 


He 


-Trp 


Trp 


Glu 


Glu Gly Lys 


Ala 


Thr 


Ala 


145 




150 










155 








160 


Lys 


Val 


Val Glu Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 




165 








170 








175 




Val 


Asp 


Gin Lys Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 




180 






185 








190 






lie 


Val 


Thr Ser Asn 
195 


Thr 


Asn 


Met 
200 


Cys 


Ala 


Val He 


Asp 
205 


Gly 


Asn 


Ser 


Thr 


Thr 
210 


Phe Glu His 


Gin 


Gin 
215 


Pro 


Leu 


Gin 


Asp Arg 
220 


Met 


Phe 


Lys 


Phe 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 


225 




230 








235 








240 


Glu 


Val 


Lys Asp Phe 
245 


Phe 


Arg 


Trp 


Ala 


Lys 
250 


Asp His 


Val 


Val 


Glu 
255 


Val 


Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys 


Gly 


Gly 


Ala Lys 


Lys 


Arg 


Pro 


Ala 






260 






265 








270 






Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg Val Arg 


Glu 


Ser 


Val 






275 






280 








285 








Ala 


Gin 


Pro Ser Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser He 


Asn 


Tyr 


Ala Asp 




290 






295 








300 










Arg 


Tyr 


Gin Asn Lys 


Cys 


Ser 


Arg 


His 


Val 


Gly Met 


Asn 


Leu 


Met 


Leu 


305 


310 










315 








320 


Phe 


Pro 


Cys Arg Gin 


Cys 


Glu 


Arg 


Met 


Asn 


Gin Asn 


Ser 


Asn 


He 


Cys 






325 








330 








335 




Phe 


Thr 


His Gly Gin 


Lys 


Asp 


Cys 


Leu 


Glu 


Cys Phe 


Pro 


Val 


Ser 


Glu 






340 






345 








350 






Ser 


Gin 


Pro Val Ser 


Val 


Val 


Lys 


Lys 


Ala 


Tyr Gin Lys 


Leu 


Cys 


Tyr 






355 






360 








365 








He 


His 


His lie Met 


Gly 


Lys 


Val 


Pro 


Asp 


Ala Cys 


Thr 


Ala 


Cys 


Asp 




370 




375 








380 










Leu 


Val 


Asn Val Asp 


Leu 


Asp 


Asp 


Cys 


He 


Phe Glu 


Gin 









385 390 395 
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<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 68 382 GCG 



<400> 151 



•"PI-* -y 
J. Il-L 


Ala 


Gly Phe Tyr Glu He Val 


-i. -l. uy o 


Val 


Pro 


Ser 


Asp 


Leu Asp 


J. 




5 




1 f) 










15 




rj-i cs 
HJLS 


Leu Pro Gly He Ser Asp 


OC--L. t lie 


Val 


Asn Trp 


Val 


Ala Glu 






20 




9 R 








30 




Jjjro 


m ii 


Trp Glu Leu Pro Pro Asp 


b7 ^ no 


Met 


Asp 


Leu 


Asn 


Leu He 




35 


40 








45 






\Jt JL Li 




Ala Pro Leu Thr Val 


Ala 


Cl~] ii T.-vrcj 
vjxu .u_y o 


Leu 


Gin 


Arg 


Asp 


Phe Leu 






55 








60 








Thr 


Glu 


Trp Arg Arg Val Ser 


Lys 


AT a Pro 


Glu 


Ala 


Leu 


Phe 


Phe Val 


65 




70 






75 








80 


Gin 


Phe 


Glu Lys Gly Glu Ser Tyr 


Phe His 

tliC 11XO 


Met 


His 


Val 


Leu 


Val Glu 






85 




on 










95 


Thr 


Thr 


Gly Val Lys Ser Met 


Val 




Arg 


Phe 


Leu 


Ser 


Gin He 






100 












110 




Arg 


Glu 


Lys Leu He Gin Arg 


He 


i y i. /i -i_ § 


Gly 


He 


Glu 


Pro 


Thr Leu 






115 


120 








125 






Pro 


Asn 


Trp Phe Ala Val Thr 


Lys 


Thr Ara 


Asn Gly Ala 


Gly Gly Gly 




130 


135 








140 








Asn 


Lys 


Val Val Asp Glu Cys 


Tyr 


Tip* Pro 


Asn Tyr 


Leu 


Leu 


Pro Lys 


145 




150 






155 








160 


Thr 


Gin 


Pro Glu Leu Gin Trp 


Ala 


Trp Thr 


Asn 


Met 


Glu 


Gin 


Tyr Leu 






165 














175 


Ser 


Ala 


Cys Leu Asn Leu Thr 


Glu 


Arcr Lvs 


Arg 


Leu 


Val 


Ala 


Gin His 






180 




185 








190 




Leu 


Thr 


His Val Ser Gin Thr 


Gin 


Glu Gin 


Asn Lys 


Glu 


Asn 


Gin Asn 






195 


200 








205 






Pro 


Asn 


Ser Asp Ala Pro Val 


He 


Arg Ser 


Lys 


Thr 


Ser 


Ala Arg Tyr 




210 


215 








220 








Met 


Glu 


Leu Val Gly Trp Leu Val 


Asp Lys 


Gly 


He 


Thr 


Ser Glu Lys 


225 




230 






235 








240 


Gin 


Trp 


He Gin Glu Asp Gin Ala 


Ser Tyr 


He 


Ser 


Phe 


Asn 


Ala Ala 




245 




250 










255 


Ser 


Asn 


Ser Arg Ser Gin He 


Lys 


Ala Ala 


Leu Asp 


Asn 


Ala Gly Lys 






260 




265 








270 




He 


Met 


Ser Leu Thr Lys Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 






275 


280 








285 






Pro 


Val 


Glu Asp He Ser Ser 


Asn 


Arg lie 


Tyr 


Lys 


He 


Leu 


Glu Leu 




290 


295 








300 








Asn Gly 


Tyr Asp Pro Gin Tyr Ala 


Ala Ser 


Val 


Phe 


Leu 


Gly Trp Ala 


305 




310 






315 








320 


Thr 


Lys 


Lys Phe Gly Lys Arg Asn 


Thr He 


Trp 


Leu 


Phe 


Gly 


Pro Ala 






325 




330 










335 


Thr 


Thr 


Gly Lys Thr Asn He Ala 


Glu Ala 


He 


Ala 


His 


Thr 


Val Pro 






340 




345 








350 




Phe 


Tyr 


Gly Cys Val Asn Trp 


Thr 


Asn Glu 


Asn 


Phe 


Pro 


Phe 


Asn Asp 






355 


360 








365 






Cys 


Val 


Asp Lys Met Val He 


Trp 


Trp Glu 


Glu Gly Lys 


Ala 


Thr Ala 




370 


375 








380 








Lys 


Val 


Val Glu Ser Ala Lys Ala 


He Leu 


Gly Gly Ser 


Lys 


Val Arg 


385 




390 






395 








400 


Val 


Asp 


Gin Lys Cys Lys Ser 


Ser 


Ala Gin 


He 


Asp 


Pro 


Thr 


Pro Val 






405 




410 










415 


He 


Val 


Thr Ser Asn Thr Asn 


Met 


Cys Ala 


Val 


He Asp 


Gly Asn Ser 
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420 



Thr 


Thr 


Phe 


Glu 


His 


Gin Gin 


Pro 






435 








440 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 




450 








455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp 


465 










470 




Glu 


His 


Glu 


Phe 


Tyr 


Val Lys 


Lys 










485 






Pro 


Ser 


Asp 


Ala 


Asp 


lie Ser 


Glu 








500 








Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 






515 








520 


Arg 


Leu 


Ala Arg Gly His Ser 


Leu 




530 








535 





<210> 152 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 152 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 
35 


Arg 


Ser 


Gin 


He 


Lys 
40 


lie 


Met 
50 


Ser 


Leu 


Thr 


Lys 


Thr 
55 


Ala 


Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 








70 






Asn 


Gly 


Tyr 


Asp 


Pro 
85 


Gin 


Tyr 


Ala 


Thr 


Lys 


Lys 


Phe 
100 


Gly 


Lys 


Arg. 


Asn 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 








120 


Phe 


Tyr 
130 


Gly 


Cys 


Val 


Asn 


Trp 
135 


Thr 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 
180 


Cys 


Lys 


Ser 


Ser 


He 


Val 


Thr 
195 


Ser 


Asn 


Thr 


Asn 


Met 
200 


Thr 


Thr 
210 


Phe 


Glu 


His 


Gin 


Gin 
215 


Pro 


Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 
245 


Phe 


Arg 


Trp 


Glu 


His 


Glu 


Phe 
260 


Tyr 


Val 


Lys 


Lys 


Pro 


Ser 


Asp 
275 


Ala 


Asp 


He 


Ser 


Glu 
280 
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425 






430 








Asp Arg 


Met 


Phe 


Lvs 


Phe 






445 








Astj Phe 


Gly Lys 


Val 


Thr 


Lvs 


Gin 




460 












Asp His 


Val 


Val 


Gin 


Val 




475 








a fl n 




Ala Lys 


Lys 


ZiTVT 


ri.u 


ai a 










ft i?D 




Pro Lys 


Arg Val Arg 




uCl 


Val 

V ax 


505 














Ser He 


Asn 


xyir 


7V "I -a 


A an 






525 








3 82 GCG 












Asp ljys 


Gly He 


Thr 


Got* 
o fcr -L 


uJLU 


jjys 


1U 












Ser Tyr 


He Ser 


Phe 


Scn 


A "J a 


Ala 

rvi cx 


AO 












Si -s A "1 -a 


Leu Asp 


Asn 


Ala 


Glv 


Lvs 






45 








JTJl u no^ 


Tyr Leu 


Val 


Glv 


Gin 


Gin 


60 












Tyr Lys 


He 


Leu 


Glu 

w -Jw 


Leu 




75 








80 


AT a Qpr 


Val Phe 


Leu 


Glv 


Tro 


Ala 


on 












Thr He 


Trp Leu 


Phe 


Glv 


Pro 


Ala 


105 






llu 






Glu Ala 


He Ala 


His 


Thr 


Val 


Pro 






125 








Asn Glu 


Asn Phe 


Pro 


Phe 


Asn 


Asp 




140 










Trp Glu 


Glu Gly Lys 


Ala 


Thr 


Ala 


155 








160 


He Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 


170 






175 




Ala Gin 


He Asp 


Pro 


Thr 


Pro 


Val 


185 




190 






Cys Ala 


Val He 


Asp 


Gly 


Asn 


Ser 




205 








Leu Gin 


Asp Arg 


Met 


Phe 


Lys 


Phe 




220 










Asp Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




235 








240 


Ala Lys 


Asp His 


Val 


Val 


Glu 


Val 


250 






255 




Gly Gly 


Ala Lys 


Lys 


Arg 


Pro 


Ala 


265 






270 






Pro Lys 


Arg Val 


Arg 


Glu 


Ser 


Val 




285 
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Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 153 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 389 GCG 



<400> 153 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


Ser Asp 


Leu Asp 


1 






5 










10 








15 


Glu 


His 


Leu 


Pro 


GJLy 


i. j.e 


ber 


ASp 


Ser 


Phe 


Val 


Asn 


Trp Val 


Ala Glu 








20 








25 








30 




Lys 


Glu 


Trp 


fcj±U 


j-teu 


riO 






Ser Asp 


Met 


Asp 


Leu Asn 


Leu He 




35 










40 










45 




Glu 


Gin 
50 


Ala 


Pro 


ijeu 


Thr 


vai 

55 


a J. a 


Glu 


Lys 


Leu 


Gin 
60 


Arg Asp 


Phe Leu 


Thr 


Glu 


Trp 


Arg 


Arg 


vai 


O f~\ 


jjys 


Ala 


Pro 


Glu 


Ala 


Leu Phe 


Phe Val 


65 






70 










75 






80 




p"h O 
ir lit; 




xj y o 




Glu 




Tvr 


Phe 


His 


Met 


His 


Val Leu 


Val Glu 








85 










90 








95 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly 


Arg 


Phe 


Leu Ser 


Gin He 






100 








105 








110 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr Arg 


Gly 


He 


Glu Pro 


Thr Leu 




115 










120 










125 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg 


Asn Gly 


Ala Gly Gly Gly 




130 










135 










140 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu Leu 


Pro Lys 


145 






150 










155 






160 


Thr 


Gin 


Pro 


Glu 


Leu 
165 


Gin 


Trp 


Ala 


Trp 


Thr 
170 


Asn 


Met 


Glu Gin 


Tyr Leu 
175 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 


Lys 


Arg 


Leu 


Val Ala 


Gin His 






1-80 










185 








190 




Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn Lys 


Glu Asn 


Gin Asn 






195 










200 










205 




Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 


Thr 


Ser Ala Arg Tyr 




210 










215 










220 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He 


Thr Ser 


Glu Lys 


225 








230 










235 






240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe Asn 


Ala Ala 








245 








250 








255 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn Ala Gly Lys 








260 








265 








270 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val Gly Gin Gin 






275 








280 










285 




Pro 


Val 
290 


Glu 


Asp 


He 


Ser 


Ser 
295 


Asn 


Arg 


He 


Tyr 


Lys 
300 


He Leu 


Glu Leu 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp Ala 


305 








310 










315 






320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe Gly Pro Ala 








325 










330 








335 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His Thr 


Val Pro 






340 










345 








350 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro Phe 


Asn Asp 




355 










360 










365 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu Gly 


Lys Met 


Thr Ala 
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370 








375 






380 








Lys 


Val 


Val 


Glu Ala 


Ala 


Lys 


Ala 


He 


Leu Gly Gly Ser 


Lys 


Val 


Arg 


385 

v 








390 








395 






400 


Val 


Asp 


Gin 


Lys Cys 


Lys 


Ser 


Ser 


Ala 


Gin lie Asp Pro 


Thr 


Pro 


Val 






405 










410 




415 




lie 


Val 


Thr 


Ser Asn 


Thr 


Asn 


Met 


Cys 


Ala Val He Asp 


Gly Asn Ser 








420 








425 




430 






Thr 


Thr 


Phe 
435 


Glu His 


Gin 


Gin 


Pro 
440 


Leu 


Gin Asp Arg Met 

445 


Phe 


Lys 


Phe 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe Gly Lys Val 


Thr 


Lys 


Gin 




450 








455 






460 








Glu 


Val 


Lys 


Asp Phe 


Phe 


Arg 


Trp 


Ala 


Lys Asp His Val 


Val 


Glu 


Val 


465 






470 








475 






480 


Glu 


His 


Glu Phe Tyr Val 


Lys 


Lys 


Gly 


Gly Ala Lys Lys 


Arg 


Pro 


Ala 








485 










490 




495 




Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys Arg Val Arg 


Glu 


Ser 


Val 








500 








505 




510 






Ala 


Gin 


Pro 


Ser Thr 


Ser 


Asp 


Ala 


Glu 


Ala Ser He Asn 


Tyr Ala Asp 






515 






520 




525 








Arg 


Tyr 


Gin 


Asn Lys 


Cys 


Ser 


Arg 


His 


Val Gly Met Asn 


Leu 


Met 


Leu 


530 








535 






540 








Phe 


Pro 


Cys 


Arg Gin 


Cys 


Glu 


Arg 


Met 


Asn Gin Asn Ser 


Asn 


He 


Cys 


545 






550 








555 






560 


Phe 


Thr 


His 


Gly Gin 
565 


Lys 


Asp 


Cys 


Leu 


Glu Cys Phe Pro 
570 


Val 


Ser 
575 


Glu 


Ser 


Gin 


Pro 


Val Ser 
580 


Val 


Val 


Lys 


Lys 
585 


Ala Tyr Gin Lys 


Leu 
590 


Cys 


Tyr 


He 


His 


His 


He Met Gly 


Lys 


Val 


Pro 


Asp Ala Cys' Thr 


Ala 


Cys 


Asp 






595 








600 




605 








Leu 


Val 
610 


Asn 


Val Asp 


Leu 


Asp 
615 


Asp 


Cys 


He Phe Glu Gin 
620 









<210> 154 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 389 GCG 



<400> 154 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys Gly He 


Thr 


Ser Glu Lys 


1 








5 










10 




15 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr He Ser 


Phe 


Asn Ala Ala 






20 








25 




30 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala Leu Asp 


Asn 


Ala Gly Lys 






35 










40 






45 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp Tyr Leu 


Val 


Gly Gin Gin 




50 








55 






60 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


He Tyr Lys 


He 


Leu Glu Leu 


65 










70 








75 




80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser Val Phe 


Leu 


Gly Trp Ala 










85 










90 




95 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He Trp Leu 


Phe 


Gly Pro Ala 








100 










105 






110 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala He Ala 


His 


Thr Val Pro 






115 










120 






125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu Asn Phe 


Pro 


Phe Asn Asp 




130 










135 






140 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly 


Lys 


Met Thr Ala 


145 










150 








155 




160 
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TjVS 


Val 


Val 


Glu 


Ala 


Ala 


Lys 


Ala 


He Leu Gly Gly Ser 


Lys Val Arg 








165 










170 










175 




vai 




Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr 


Pro 


Val 






180 










185 










190 






-Lie 


V d-L 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val 


He 


Asp 


Gly Asn Ser 






195 










200 










205 








inr 


111X7 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg Met 


Phe 


Lys 


Phe 




210 










215 










220 










Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His Asp Phe Gly Lys Val 


Thr 


Lys 


Gin 


225 










230 










235 










240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp Ala 


Lys 


Asp His Val 


Val 


Glu 


Val 










245 










250 










255 




Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 








260 








265 










270 






Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg Val Arg 


Glu 


Ser 


Val 






275 










280 










285 








Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser 


He 


Asn 


Tyr Ala Asp 




290 










295 










300 










Arg 


Tyr 


Gin Asn Lys 


Cys 


Ser 


Arg 


His 


Val 


Gly Met Asn 


Leu 


Met 


Leu 


305 








310 










315 










320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg Met 


Asn 


Gin 


Asn 


Ser 


Asn 


He 


Cys 






325 










330 










335 




Phe 


Thr 


His Gly Gin 


Lys 


Asp 


Cys 


Leu 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 








340 










345 










350 






Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 






355 










360 










365 








He 


His 


His 


He 


Met 


Gly 


Lys 


Val 


Pro 


Asp 


Ala 


Cys 


Thr 


Ala 


Cys 


Asp 




370 










375 










380 










Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 


Cys 


He 


Phe 


Glu 


Gin 








385 










390 










395 












<210> 155 




























<211> 536 




























<212> PRT 




























<213> Artificial Sequence 




















<220> 






























<223> Mutant ~rep protein: rep 6 8 


389 


GCG 














<400> 155 




























Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


Ser 


Asp 


Leu Asp 


1 






5 










10 










15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe Val Asn Trp 


Val 


Ala 


Glu 








20 








25 










30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp 


Leu 


Asn 


Leu 


He 




35 










40 










45 








Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu Gin Arg 


Asp 


Phe 


Leu 




50 










55 










60 








Val 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


65 






70 










75 










80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 


His 


Val 


Leu 


Val 


Glu 








85 








90 










95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 






100 










105 










110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He Tyr Arg Gly He Glu 


Pro 


Thr 


Leu 




115 










120 










125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn Gly Ala 


Gly Gly Gly 




130 








135 










140 










Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


145 








150 










155 










160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr Asn 


Met 


Glu 


Gin Tyr 


Leu 
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165 










170 










175 




Ser 


Ala 


Cys Leu 


Asn 


Leu 


Thr 


Glu 


Arg 


Lys Arg 


Leu 


Val 


Ala 


Gin 


His 






180 










185 










190 






Leu 


Thr 


His Val 
195 


Ser 


Gin 


Thr 


Gin 
200 


Glu 


Gin 


Asn 


Lys 


Glu 
205 


Asn 


Gin 


Asn 


Pro 


Asn 


Ser Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 
— * 


Tyr 




210 






215 










220 










Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


225 






230 










235 










240 


Gin Trp 


He Gin 


Glu 


Asp 


Gin Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 








245 










250 










255 




Ser 


Asn 


Ser Arg 
260 


Ser 


Gin 


He 


Lys 


Ala 
265 


Ala 


Leu 


Asp 


Asn 


Ala 
270 


Gly 


Lys 


He 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Gin 






275 








280 










285 








Pro 


Val 


Glu Asp 


He 


Ser 


Ser Asn Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 




290 






295 










300 










Asn Gly 


Tyr Asp 


Pro 


Gin Tyr Ala Ala 


Ser 


Val 


Phe 


Leu Gly 


Trp 
— 


Ala 


305 






310 










315 










320 


Thr 


Lys 


Lys Phe Gly 


Lys Arg Asn 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 






325 










330 










335 




Thr 


Thr 


Gly Lys 
340 


Thr 


Asn 


He 


Ala 


Glu 
345 


Ala 


He 


Ala 


His 


Thr 
350 


Val 


Pro 


Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 




355 








360 










365 








Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 


370 






375 










380 










Lys 


Val 


Val Glu 


Ala 


Ala 


Lys 


Ala 


He 


Leu Gly 


Gly Ser 


Lys 


Val 


Arg 


385 








390 








395 










400 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 




405 










410 










415 




He 


Val 


Thr Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Ala 


Val 


He 


Asp 


Gly 
430 


Asn 


Ser 


Thr 


Thr 


Phe Glu 
435 


His 


Gin 


Gin 


Pro 
440 


Leu 


Gin 


Asp 


Arg 


Met 
445 


Phe 


Lys 


Phe 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 




450 








455 










460 










Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 


465 






470 










475 










480 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 








485 








490 










495 




Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 






500 










505 










510 






Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser 


He 


Asn 


Tyr 


Ala 


Asp 






515 






520 










525 








Arg 


Leu 
530 


Ala Arg 


Gly 


His 


Ser 
535 


Leu 



















<210> 156 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 389 GCG 
<400> 156 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly 

15 10 
Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He 

20 25 



He Thr Ser Glu Lys 

15 

Ser Phe Asn Ala Ala 
30 
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Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 










40 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 












70 






A en 


\Jijr 


Tvr 


Asp 


Pro 


Gin Tyr Ala 










85 








X I XX 




T ,vrc» 


Phe Gly 


Lys Arg Asn 








100 










xllx 


1 XI X 


ijrxy 


Lys 


Thr 


Asn 


He 


Ala 
















i on 


T>V> 




Glv 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ala 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr 


Arg Arg 


Leu Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 








260 










Pro 


Ser 


Asp 


Ala Asp 


He 


Ser 


Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 




290 










295 




Arg 


Leu 


Ala 


Arg Gly 


His 


Ser 


Leu 


305 










310 






<210> 157 












<211> 621 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> Mutant rep protein: rep78 


<400> 157 












Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


1 








5 








Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 








85 






Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 



Ala 


Ala 


Leu Asp Asn Ala Gly 


Lys 






45 




Pro 


Asp 


Tyr Leu Val Gly Gin 


Gin 






60 




Arg 


He 


Tyr Lys He Leu Glu 


Leu 






75 


80 


Ala 


Ser 


Val Phe Leu Gly Trp 


Ala 




90 


95 




Thr 


He 


Trp Leu Phe Gly Pro 


Ala 


105 




110 




Glu 


Ala 


He Ala His Thr Val 


Pro 






125 




Asn 


Glu 


Asn Phe Pro Phe Asn 


Asp 






140 




Trp 


Glu 


Glu Gly Lys Met Thr 


Ala 




155 


160 


He 


Leu 


Gly Gly Ser Lys Val 


Arg 




170 


175 




Ala 


Gin 


He Asp Pro Thr Pro 


Val 


185 




190 




Cys 


Ala 


Val He Asp Gly Asn 


Ser 






205 




Leu 


Glh 


Asp Arg Met Phe Lys 


Phe 






220 




Asp 


Phe 


Gly Lys Val Thr Lys 


Gin 




235 


240 


Ala 


Lys 


Asp His Val Val Glu 


Val 




250 


255 




Gly Gly 


Ala Lys Lys Arg Pro 


Ala 


265 




270 




Pro 


Lys 


Arg Val Arg Glu Ser 


Val 






285 




Glu 


Ala 


Ser He Asn Tyr Ala 


Asp 



300 



407 


GCC 










He 


Lys 


Val Pro 


Ser Asp 


Leu 


Asp 




10 






15 




Ser 


Phe 


Val Asn Trp Val 


Ala 


Glu 


25 






30 






Ser 


Asp 


Met Asp 


Leu Asn 


Leu 


He 






45 






Glu 


Lys 


Leu Gin Arg Asp 


Phe 


Leu 




60 








Ala 


Pro 


Glu Ala 


Leu Phe 


Phe 


Val 






75 






80 


Phe 


His 


Met His 


Val Leu 


Val 


Glu 




90 






95 




Leu Gly 


Arg Phe 


Leu Ser 


Gin 


He 


105 






110 






Tyr Arg 


Gly He 


Glu Pro 


Thr 


Leu 
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115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


X45 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 






180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trn 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu Asp 




290 






Asn 


Gly 


Tyr Asp 


305 








Thr 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


lie 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


Gin 


Asn 




530 






Phe 


Pro 


Cys Arg 


545 








Phe 


Thr 


His 


Gly 


Ser 


Gin 


Pro 


Val 








580 


He 


His 


His 


He 






595 





120 

Ala Val Thr Lys 
135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ala Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 

600 



Thr 


Arg 


Asn Gly 






140 


He 


Pro 


Asn Tyr 






155 


Trp 


Thr 


Asn Met 




170 




/\rg 


juys 


ATy ijcu 


185 






Glu 


Gin 


Asn Lys 


Arg 


Ser 


Lys Thr 






220 


Asp 


Lys 


Gly He 






235 


Ser 


Tyr 


He Ser 




250 




ax a 


ax a 


JjcU ASp 


265 






Pro 


Asp 


Tyr Leu 


Arg 


He 


Tyr Lys 






300 


Ala 


Ser 


Val Phe 






315 


Thr 


He 


Trp Leu 




330 




bill 


Axa 


X lc Aid 


345 






Asn 


Glu 


Asn Phe 


Trp 


Glu 


Glu Gly 






380 


He 


Leu 


Gly Gly 






395 


Ala 


Gin 


He Asp 




410 




Cys 


Ala 


Val He 


425 






Leu 


Gin 


Asp Arg 


Asp 


Phe 


Gly Lys 






460 


Ala 


Lys 


Asp His 






475 


Gly 


Gly 


Ala Lys 




490 




Pro 


Lys 


Arg Val 


505 






Glu 


Ala 


Ser He 


His 


Val 


Gly Met 






540 


Met 


Asn 


Gin Asn 






555 


Leu 


Glu 


Cys Phe 




570 




Lys 


Ala 


Tyr Gin 


585 






Pro 


Asp 


Ala Cys 



125 



Ala 


Gly 


Gly 


Gly 


Leu 


Leu 


Pro 


Lys 








160 


Glu 


Gin 


Tyr 


Leu 






175 




Val 


Ala 


Gin 


His 




190 






bill 


TV gj y> 
noil 




ash 


205 








Ser 


Ala 


Arg 


Tyr 


Thr 


Ser 


Glu 


Lys 








240 


Phe 


Asn 


Ala 


Ala 






255 




Asn 


Ala 


Gly 


Lys 




270 






vax 


v»xy 


bin 


bin 


285 








He 


Leu 


Glu 


Leu 


Leu 


Gly 


Trp 


Ala 








320 


Phe 


Gly 


Pro 


Ala 






335 




His 


Thr 


Val 


Pro 




350 






rlf 




nsii 




365 








Lys 


Met 


Thr 


Ala 


Ser 


Lys 


Val 


Arg 








400 


Pro 


Thr 


Pro 


Val 






415 




Asp 


Gly 


Asn 


Ser 




430 






Met 


Phe 


Lys 


Phe 


445 








Val 


Thr 


Lys 


Gin 


Val 


Val 


Glu 


Val 








480 


Lys 


Arg 


Pro 


Ala 






495 




Arg 


Glu 


Ser 


Val 




510 






Asn 


Tyr 


Ala 


Asp 


525 








Asn 


Leu 


Met 


Leu 


Ser 


Asn 


He 


Cys 








560 


Pro 


Val 


Ser 


Glu 






575 




Lys 


Leu 


Cys 


Tyr 




590 






Thr 


Ala 


Cys 


Asp 



605 
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Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
610 615 620 

<210> 158 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 407 GCC 



<400> 158 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


1 








5 








10 










15 




Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 






20 








25 








30 






Ser 


Asn 


Ser 


Arcr 


Ser 


Gin 


He 


Lys 


Ala Ala 


Leu 


Asp 


Asn 


Ala 


Gly Lys 






35 








40 








45 








lie 

JL A 


Met 


Ser 


Leu 


Thr 


LVS 


Thr 


Ala 


Pro Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 




-J V» 








55 








60 










Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 


65 








70 








75 










80 


Asn 

noil 


Glv 


Tvr 


AsD 


Pro 


Gin 


Tvr 


Ala 


Ala Ser 


Val 


Phe 


Leu Gly Trp Ala 




85 








90 










95 




Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 




100 

-4- W w 










105 








110 






Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu Ala 


He 


Ala 


His 


Thr 


Val 


Pro 






115 








120 








125 








Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 




130 






135 








140 










Cys 


Val 


Asp 


Lys 


Met 


Val 


lie 


Trp 


Trp Glu 


Glu 


Gly Lys 


Met 


Thr 


Ala 


145 






150 








155 










160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


lie Leu Gly 


Gly 


Ser 


Lys 


Val 


Arg 








165 






170 










175 




Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ala 


Ser 


Ala Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 






180 






185 








190 






He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys Ala 


Val 


He Asp Gly Asn 


Ser 






195 










200 








205 








Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 




210 










215 








220 










Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His Asp Phe Gly 


Lys 


Val 


Thr 


Lys 


Gin 


225 








230 








235 










240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Tzp Ala Lys Asp 


His 


Val 


Val 


Glu 


Val 










245 








250 










255 




Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 








260 








265 








270 






Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu Pro Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 






275 








280 








285 








Ala 


Gin 
290 


Pro 


Ser 


Thr 


Ser 


Asp 
295 


Ala 


Glu Ala 


Ser 


He 
300 


Asn 


Tyr 


Ala 


Asp 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg His Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 


305 








310 








315 










320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg Met Asn Gin 


Asn 


Ser 


Asn 


He 


Cys 






325 








330 










335 




Phe 


Thr 


His 


Gly 
340 


Gin 


Lys 


Asp 


Cys 


Leu Glu 
345 


Cys 


Phe 


Pro 


Val 
350 


Ser 


Glu 


Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys Ala Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 






355 










360 








365 








He 


His 
370 


His 


He 


Met 


Gly 


Lys 
375 

• 


Val 


Pro Asp 


Ala 


Cys 
380 


Thr 


Ala 


Cys 


Asp 


Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp Cys He 


Phe 


Glu 


Gin 
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<210> 159 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 407 



<400> 159 



Thr 
t 


Ala 


Gly 


File 


Tyr 

c 


bill 


Tip 
1 16 


Val 


Tip 


i 

bill 


XT-5 it- 
rllS 


lieu 


riO 




Tip 


OCI 












20 










Lys 


bj±U 


Trp 


bxlU 


lieu 


rtu 


Dm 


Asp 


OCX. 














40 






oin 


.hi a 


riU 


T ,oi i 

iicu 


111X 


v ax 


ma 


nl ii 














— * 






Tnr 


CilU 


irp 




ATy 


Val 


Cor 
OCX 


T a 

-tjy s 


a J. a 


c c 

OD 


















bin 


rile 


/"•I ii 

1?1U 


iiys 


pi , r 

vjiy 




OCX 


lyr 


XT 11C 


±nr 


hp Vt ^* 

inr 


vaiy 


Val 


O J 

Lys 


O O T~ 


Pic U 


V ct-L 








1 oo 

J- w w 








105 

v/ —-^ 


ATy 




j_iys 


JjcU 


lie 


V7J.I1 




J L ™ 


X jr X. 




115 










120 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 




130 








135 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


145 










150 








Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 










165 










Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 






180 










185 


Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 






195 










200 




Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


lie 


Arg 




210 










215 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


225 










230 








Gin 


Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 










245 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


lie 


Lys 


Ala 








260 










265 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 






275 








280 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 




290 








295 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


305 










310 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 










325 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


lie 


Ala 


Glu 






340 










345 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 






355 










360 




Cys 


Val 


Asp 


Lys 


Met 


Val 


lie 


Trp 


Trp 


370 










375 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 
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395 



GCC 



Lys 


Val 


Pro 


Ser Asp 


Leu 


Asp 


1 0 








15 




xrllc 


Val 


Asn 


Trp Val 


Ala 


Glu 








30 






ASp 


Met 


Asp 


Leu Asn 


Leu 


He 






45 






xjys 


Leu 


Gin 


Arg Asp 


Phe 


Leu 






60 








xrro 


Glu 


Ala 


Leu Phe 


Phe 


Val 




75 








80 


IT J. S 


Met 


His 


Val Leu 


Val 


Glu 


Of) 








95 




Pi »- 

bxly 


Arg 


Phe 


Leu Ser 


Gin 


He 








110 








Gly 


He 


Glu Pro 


Thr 


Leu 








125 






Arg 


Asn 


Gly 


Ala Gly Gly Gly 






140 








Pro 


Asn 


Tyr 


Leu Leu 


Pro 


Lys 




155 






160 


Thr 


Asn 


Met 


Glu Gin 


Tyr 


Leu 


170 








175 




Lys 


Arg 


Leu 


Val Ala 


Gin 


His 








190 






Gin 


Asn 


Lys 


Glu Asn 


Gin 


Asn 






205 






Ser 


Lys 


Thr 


Ser Ala 


Arg 


Tyr 




220 








Lys 


Gly 


He 


Thr Ser Glu Lys 


235 








240 


Tyr 


He 


Ser 


Phe Asn 


Ala 


Ala 


250 








255 




Ala 


Leu Asp 


Asn Ala 


Gly Lys 








270 






Asp 


Tyr 


Leu 


Val Gly Gin Gin 








285 






He 


Tyr 


Lys 


He Leu 


Glu 


Leu 






300 








Ser 


Val 


Phe 


Leu Gly Trp Ala 




315 








320 


He 


Trp 


Leu 


Phe Gly 


Pro 


Ala 


330 






335 




Ala 


He 


Ala 


His Thr 


Val 


Pro 








350 






Glu 


Asn 


Phe 


Pro Phe 


Asn Asp 








365 






Glu 


Glu Gly 


Lys Met 


Thr 


Ala 






380 








Leu 


Gly Gly 


Ser Lys 


Val Arg 




395 








400 
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Val 


Asp 


Gin Lys Cys 


Lys 


Ala 


Ser 






405 








lie 


Val 


Thr Ser Asn 


Thr 


Asn 


Met 






420 








Thr 


Thr 


Phe Glu His 


Gin 


Gin 


Pro 






435 






440 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 




450 






455 




Glu 


Val 


Lys Asp Phe 


Phe 


Arg Trp 


465 






470 






Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys 






485 








Pro 


Ser 


Asp Ala Asp 


lie 


Ser 


Glu 






50 0 








Ala 


Gin 


Pro Ser Thr 


Ser Asp Ala 






515 






520 


Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 




530 






535 





<210> 160 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 160 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 










40 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ala 


Ser 








180 










lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 
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Ala Gin 


He Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Cys Ala 


Val He 


Asp 


Gly 


Asn 


Ser 


425 






430 






Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






445 








Asp Phe Gly Lys 


Val 


Thr 


Lys 


Gin 




460 










Ala Lys Asp His 


Val 


Val 


Glu 


Val 




475 








480 


Gly Gly Ala Lys 






Pro 


Ala 


a o n 








495 




irro Jjys 


Arg va.1 


Arg 


Glu 


Ser 


Val 


DUD 






510 






CjIU AJLa 


-i_r— | 

ber lie 


Asn 


Tvr 


Ala Asp 














407 GCC 












Asp Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


10 








15 




Ser Tyr 


He Ser 


Phe 


Asn 


Ala 


Ala 


25 






30 






Ala Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 






45 








Pro Asp 


Tyr Leu 


Val 


Gly 


Gin 


Gin 




60 










Arg He 


Tyr Lys 


He 


Leu 


Glu 


Leu 




75 








80 


Ala Ser 


Val Phe 


Leu 


Gly 


Trp Ala 


90 








95 




Thr He 


Trp Leu 


Phe 


Gly 


Pro 


Ala 


105 






110 






Glu Ala 


He Ala 


His 


Thr 


Val 


Pro 






125 








Asn Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 




140 










Trp Glu 


Glu Gly 


Lys 


Met 


Thr 


Ala 




155 








160 


He Leu Gly Gly 


Ser 


Lys 


Val 


Arg 


170 








175 




Ala Gin He Asp 


Pro 


Thr 


Pro 


Val 


185 






190 






Cys Ala Val He 


Asp 


Gly 


Asn 


Ser 






205 








Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 




220 










Asp Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 




235 








240 


Ala Lys 


Asp His 


Val 


Val 


Glu 


Val 


250 








255 




Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 
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260 

Pro Ser Asp Ala 
275 

Ala Gin Pro Ser 
290 

Arg Leu Ala Arg 
305 



Asp lie Ser Glu 

280 

Thr Ser Asp Ala 
295 

Gly His Ser Leu 
310 



265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 



270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 



<210> 161 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep78 411 GCA 



<400> 161 




Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 






Thr 


Gin 


Pro 


Olu 


Ser 


Ala 


Cys 


Leu 






180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


He 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Thr 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 



Tyr 


GJ-U 


xj.e 


vai 










Gly 


i±e 


ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 






280 


He 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 




310 






Gly 


Lys 


Arg 


Asn 


325 








Thr 


Asn 


He 


Ala 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 



Ser Asp 


JjcU 


»^ 






-L 3 




Trp 


Val 


Ala 


ulU 


30 






Leu 


Asn 


Leu 


Tl 

I JLe 


45 








Arg 


Asp 


Phe 


Leu 


Leu 


Phe 


Phe 


Val 








80 


Val 


Leu 


Val 


Glu 






95 




Leu 


Ser 


Gin 


He 




110 






Glu 


Pro 


Thr 


Leu 


125 








Ala Gly 


Gly 


Gly 


Leu 


Leu 


Pro 


Lys 








160 


Glu 


Gin 


Tyr 


Leu 






175 




Val 


Ala 


Gin 


His 




190 






Glu 


Asn 


Gin 


Asn 


205 








Ser 


Ala 


Arg 


Tyr 


Thr 


Ser 


Glu 


Lys 








240 


Phe 


Asn 


Ala 


Ala 






255 




Asn 


Ala 


Gly 


Lys 




270 






Val 


Gly 


Gin 


Gin 


285 








He 


Leu 


Glu 


Leu 


Leu Gly 


Trp 


Ala 








320 


Phe 


Gly 


Pro 


Ala 






335 




His 


Thr 


Val 


Pro 




350 
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Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro Phe Asn Asp 




355 










360 










3 65 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys Met Thr Ala 


370 






375 










380 




Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly 


Ser Lys Val Arg 


385 

\J 










390 










395 




400 


Val 


Asd 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


Ala Asp 


Pro Thr Pro Val 






405 










410 






415 


He 


Val 


Thr 


Ser 
420 


Asn 


Thr 


Asn 


Met 


Cys 
425 


Ala 


Val 


He 


Asp Gly Asn Ser 
430 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp Arg 


Met Phe Lys Phe 






435 










440 










445 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly Lys 


Val Thr Lys Gin 




450 










455 










460 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


Lys 


Asp 


His 


Val Val Glu Val 


465 






470 










475 




480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly 


Ala 


Lys 


Lys Arg Pro Ala 










485 










490 






495 


Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg Val 


Arg Glu Ser Val 






500 








505 








510 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp Ala Glu Ala 


Ser 


He 


Asn Tyr Ala Asp 






515 










520 










525 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg His Val 


Gly 


Met 


Asn Leu Met Leu 


530 










535 










540 


Ser Asn He Cys 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg Met 


Asn 


Gin 


Asn 


545 






550 










555 




560 


Phe 


Thr 


His Gly Gin Lys 


Asp 


Cys 


Leu 


Glu 


Cys 


Phe 


Pro Val Ser Glu 










565 










570 






575 


Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala 


Tyr 


Gin 


Lys Leu Cys Tyr 








580 








585 








590 


He 


His 


His 


He 


Met 


Gly 


Lys 


Val 


Pro 


Asp 


Ala 


Cys 


Thr Ala Cys Asp 






595 








600 










605 


Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp Asp 


Cys 


He 


Phe 


Glu 


Gin 




610 










615 










620 





<210> 162 

<211> 397 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: 52 411 GCA 



<400> 162 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He 


Thr Ser Glu 


Lys 


1 








5 








10 






15 




Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe Asn Ala 


Ala 






20 








25 








30 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn Ala Gly 


Lys 






35 








40 










45 


Gin 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr 


Leu 


Val Gly Gin 




50 








55 










60 


He Leu Glu 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr 


Lys 


Leu 


65 








70 










75 






80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp 


Ala 






85 










90 






95 




Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe Gly Pro 


Ala 




100 








105 








110 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His Thr Val 


Pro 






115 








120 










125 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro Phe Asn Asp 
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130 










135 








140 










Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly 


Lys 


Met 


Thr 


Ala 


145 










150 








155 










160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 








165 










170 








175 




Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin Ala Asp 


Pro 


Thr 


Pro 


Val 






180 










185 








190 






lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala Val 


He 


Asp 


Gly 


Asn 


Ser 






195 










200 








205 








Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 




210 










215 








220 










Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe Gly Lys 


Val 


Thr 


Lys 


Gin 


225 










230 








235 










240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala 


Lys Asp 


His 


Val 


Val 


Glu 


Val 








245 










250 








255 




Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 


Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 








260 








265 








270 






Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys Arg 


Val 


Arg 


Glu 


Ser 


Val 






275 










280 








285 








Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp Ala Glu 


Ala Ser 


He 


Asn 


Tyr 


Ala 


Asp 




290 










295 








300 










Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg His 


Val Gly Met 


Asn 


Leu 


Met 


Leu 


305 








310 








315 










320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg 


Met 


Asn Gin 


Asn 


Ser 


Asn 


He 


Cys 








325 










330 








335 




Phe 


Thr 


His Gly Gin 


Lys 


Asp 


Cys 


Leu 


Glu Cys 


Phe 


Pro 


Val 


Ser 


Glu 








340 








345 








350 






Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala Tyr Gin 


Lys 


Leu 


Cys 


Tyr 






355 










360 








365 








lie 


His 


His 


lie 


Met 


Gly 


Lys 


Val 


Pro 


Asp Ala Cys 


Thr 


Ala 


Cys 


Asp 




370 










375 








380 










Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 


Cys 


He Phe 


Glu 


Gin 








385 










390 








395 












<210> 163 


























<211> 536 


























<212> PRT 


























<213> Artificial Sequence 


















<220> 




























<223> Mutant rep protein: 68 411 GCA 












<400> 163 


























Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys Val 


Pro 


Ser 


Asp 


Leu 


Asp 


1 






5 










10 








15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe Val 


Asn 


Trp 


Val 


Ala 


Glu 








20 








25 








30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp Met 


Asp 


Leu 


Asn 


Leu 


He 




35 










40 








45 








Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys Leu 


Gin 


Arg 


Asp 


Phe 


Leu 




50 










55 








60 










Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro Glu 


Ala 


Leu 


Phe 


Phe 


Val 


65 






70 








75 










80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His Met 


His 


Val 


Leu 


Val 


Glu 








85 








90 








95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly Arg Phe 


Leu 


Ser 


Gin 


He 








100 








105 








110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr 


Arg Gly He 


Glu 


Pro 


Thr 


Leu 




115 










120 








125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg Asn Gly 


Ala 


Gly 


Gly 


Gly 




130 










135 








140 
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Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 ~ * 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 

290 295 ~ 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 ' ~ 380 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin Ala Asp Pro Thr Pro Val 

405 410 415 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 - 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 * 495 

Pro Ser Asp Ala Asp lie Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 50'5 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 

515 520 525 

Arg Leu Ala Arg Gly His Ser Leu 
530 535 

<210> 164 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 40 411 GCA 



<400> 164 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 



WO 03/018820 



PCT7IB02/04087 



-198- 



1 5 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 * 60 

Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 105 * 110 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

165 170 " ' 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin Ala Asp Pro Thr Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 " 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly His Ser Leu 
305 310 

<210> 165 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 414 GCT 
<400> 165 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 

85 90 95 
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Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 no 

Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly He Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 

165 170 175 

Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 " 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

275 280 285 

Pro Val Glu Asp He Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 - 390 395 ~ 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Ala Pro Val 

405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 * * 495 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 
545 550 555 560 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 
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580 585 590 

lie His His He Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

595 600 * 605 

Leu Val Asn Val Asp Leu Asp Asp Cys He Phe Glu Gin 
610 615 620 

<210> 166 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 414 GCT 
<400> 166 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly He Thr Ser Glu Lys 

15 10 4 15 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 

35 40 45 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 

50 55 60 

Pro Val Glu Asp lie Ser Ser Asn Arg He Tyr Lys He Leu Glu Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Thr Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 105 ~ 110 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Ala Pro Val 

180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 

245 ~ 250 255 

Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 

260 265 270 

Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg Val Arg Glu Ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser He Asn Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
305 310 315 320 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn He Cys 

325 ~ 330 335 

Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 

340 345 350 

Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 
355 360 "* 365 
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Ile His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 

370 ~ 375 380 

Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
385 390 395 

<210> 167 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein :rep68 414 GCT 



<4Q0> 167 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He 


Lys 


Val 


Pro 


Ser Asp 


Leu 


Asp 


1 






5 










10 










15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Asn 


Trp 


Val 


Ala 


Glu 








20 








25 










30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 




35 










40 










45 








Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin 


Arg Asp 


Phe 


Leu 




50 










55 










60 










Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


Ala 


Leu 


Phe 


Phe 


Val 


65 




70 










75 










80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe 


His 


Met 


His 


Val 


Leu 


Val 


Glu 








85 










90 










95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu 


Gly 


Arg 


Phe 


Leu 


Ser 


Gin 


He 






100 








105 










110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


Tie 


Tyr 


Arg 


Gly 


He 


Glu 


Pro 


Thr 


Leu 




115 










120 










125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg 


Asn Gly 


Ala Gly Gly Gly 




130 








135 










140 










Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


145 








150 










155 










160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 


Met 


Glu 


Gin 


Tyr 


Leu 










165 






170 










175 


His 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 






180 










185 










190 






Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu 


Gin 


Asn Lys 


Glu 


Asn 


Gin 


Asn 






195 










200 










205 








Pro 


Asn 
210 


Ser 


Asp 


Ala 


Pro 


Val 
215 


He 


Arg 


Ser 


Lys 


Thr 
220 


Ser 


Ala 


Arg 


Tyr 


Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser 


Glu 


Lys 


225 








230 










235 










240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 








245 








250 










255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala 


Gly 


Lys 








260 










265 










270 






lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr 


Leu 


Val Gly Gin Gin 






275 








280 










285 








Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 




290 










295 










300 








Ala 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


Leu Gly Trp 


305 






310 










315 










320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 








325 










330 










335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 






340 










345 










350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 




355 










360 










365 






Ala 


Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu Gly 


Lys 


Met 


Thr 
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370 






375 


Lys 


Val 


Val Glu Ser 


Ala 


Lys Ala 


385 






390 




Val 


Asp 


Gin Lys Cys 


Lys 


Ser Ser 






405 






lie 


Val 


Thr Ser Asn 


Thr 


Asn Met 






420 


* 




Thr 


Thr 


Phe Glu His 


Gin 


Gin Pro 






435 




440 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp His 




450 






455 


Glu 


Val 


Lys Asp Phe 


Phe 


Arg Trp 


465 






470 




Glu 


His 


Glu Phe Tyr 


Val 


Lys Lys 






485 






Pro 


Ser 


Asp Ala Asp 


He 


Ser Glu 






500 






Ala 


Gin 


Pro Ser Thr 


Ser 


Asp Ala 






515 




520 


Arg 


Leu 


Ala Arg Gly His 


Ser Leu 




530 






535 



<210> 168 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 168 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 










40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 



225 230 



380 



He 


Leu Gly Gly Ser Lys 


Val 


Arg 




395 




400 


Ala 


Gin He Asp Pro Ala 


Pro 


Val 




410 


415 




Cys 


Ala Val He Asp Gly Asn 


Ser 


425 


430 






Leu 


Gin Asp Arg Met Phe 


Lys 


Phe 




445 






Asp 


Phe Gly Lys Val Thr 


Lys 


Gin 




460 






Ala 


Lys Asp His Val Val 


Glu 


Val 




475 




480 


Gly 


Gly Ala Lys Lys Arg 


Pro 


Ala 




490 


495 




Pro 


Lys Arg Val Arg Glu 


Ser 


Val 


505 


510 






Glu 


Ala Ser He Asn Tyr 


Ala 


Asp 



525 



414 


GCT 








Asp 


Lys Gly He Thr 


Ser 


Glu 


Lys 




10 




15 




Ser 


Tyr He Ser Phe 


Asn 


Ala 


Ala 


25 


30 






Ala 


Ala Leu Asp Asn Ala Gly 


Lys 




45 








Pro 


Asp Tyr Leu Val 


Gly Gin 


Gin 




60 








Arg 


He Tyr Lys He 


Leu 


Glu 


Leu 




75 






80 


Ala 


Ser Val Phe Leu Gly Trp 


Ala 




90 




95 




Thr 


He Trp Leu Phe 


Gly 


Pro 


Ala 


105 


110 






Glu 


Ala He Ala His 


Thr 


Val 


Pro 




125 








Asn 


Glu Asn Phe Pro 


Phe 


Asn 


Asp 




140 








Trp 


Glu Glu Gly Lys 


Met 


Thr 


Ala 




155 






160 


He 


Leu Gly Gly Ser Lys 


Val 


Arg 




170 




175 




Ala 


Gin He Asp Pro 


Ala 


Pro 


Val 


185 


190 






Cys 


Ala Val He Asp 


Gly Asn 


Ser 




205 








Leu 


Gin Asp Arg Met 


Phe 


Lys 


Phe 




220 








Asp 


Phe Gly Lys Val 


Thr 


Lys 


Gin 




235 






240 
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Glu Val Lys Asp 

Glu His Glu Phe 

260 

Pro Ser Asp Ala 
275 

Ala Gin Pro Ser 
290 

Arg Leu Ala Arg 
305 



Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp lie Ser Glu 

280 

Thr Ser Asp Ala 
295 

Gly His Ser Leu 
310 



Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 



Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 



<210> 169 

<211> 621 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: 78 420 GCT 



<400> 169 



Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys 


Val Pro 


Ser Asp 


Leu 


Asp 


1 






5 








10 






15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser Phe 


Val Asn 


Trp Val 


Ala 


Glu 








20 








25 




30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp Met Asp 


Leu Asn 


Leu 


He 






35 










40 






45 






Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys 


Leu Gin 


Arg Asp 


Phe 


Leu 




50 










55 






60 








Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro 


Glu Ala 


Leu Phe 


Phe 


Val 


65 






70 








75 






80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His 


Met His 


Val Leu 


Val 


Glu 










85 








90 






95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg Phe 


Leu Ser 


Gin 


He 








100 










105 




110 






Arg 


Glu 


Lys 


Leu 


lie 


Gin 


Arg 


He 


Tyr Arg Gly He 


Glu Pro 


Thr 


Leu 






115 










120 






125 






Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly 


Ala Gly Gly Gly 




130 










135 






140 








Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro Asn Tyr 


Leu Leu 


Pro 


Lys 


145 










150 








155 






160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr 


Asn Met 


Glu Gin 


Tyr 


Leu 










165 






170 






175 




Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg Lys 


Arg Leu 


Val Ala 


Gin 


His 








180 










185 




190 






Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin 


Asn Lys 


Glu Asn 


Gin 


Asn 






195 










200 






205 






Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg Ser 


Lys Thr 


Ser Ala 


Arg 


Tyr 




210 










215 






220 








Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys 


Gly He 


Thr Ser 


Glu 


Lys 


225 








230 








235 






240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He Ser 


Phe Asn 


Ala 


Ala 










245 






250 






255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala 


Leu Asp 


Asn Ala Gly Lys 








260 










265 




270 






He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp 


Tyr Leu 


Val Gly Gin 


Gin 






275 








280 






285 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He 


Tyr Lys 


He Leu 


Glu 


Leu 




290 










295 






300 








Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser 


Val Phe 


Leu Gly Trp Ala 


305 










310 








315 






320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He 


Trp Leu 


Phe Gly 


Pro 


Ala 
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325 










330 






335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr Val 


Pro 






340 










345 








350 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Asn Asp 












360 








365 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu Gly Lys 


Met Thr 


Ala 


370 










375 








380 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly Ser 


Lys Val 


Arg 


385 










390 










395 






400 


Val 


Asp 


— « 

Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


lie Asp 


Pro 


Thr Pro 


Val 


* 






405 










410 






415 




lie 


Val 


Thr 


Ala 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val He 


Asp 


Gly Asn 


Ser 








420 










425 








430 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp Arg 


Met 


Phe Lys 


Phe 






435 










440 








445 






Glu 


Leu 


Thr 


Arg Arg Leu Asp 


His 


Asp 


Phe 


Gly Lys 


Val 


Thr Lys 


Gin 




450 










455 








460 








Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp Ala 


Lys 


Asp His 


Val 


Val Glu 


Val 


465 










470 










475 






480 


Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly Gly 


Ala Lys 


Lys 


Arg Pro 


Ala 










485 










490 






495 




Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg Val 


Arg 


Glu Ser 


Val 






500 








505 








510. 




Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 


Ala 


Ser He 


Asn 


Tyr Ala Asp 






515 










520 








525 






Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


His 


Val 


Gly Met 


Asn 


Leu Met 


Leu 




530 










535 








540 








Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 


Met 


Asn 


Gin Asn 


Ser 


Asn He 


Cys 


545 








550 










555 






560 


Phe 


Thr 


His 


Gly Gin Lys 


Asp 


Cys 


Leu 


Glu 


Cys Phe 


Pro 


Val Ser 


Glu 










565 










570 






575 




Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 


Ala 


Tyr Gin Lys 


Leu Cys 


Tyr 








580 










585 








590 




He 


His 


His 


He Met Gly Lys 


Val 


Pro 


Asp 


Ala Cys 


Thr 


Ala Cys 


Asp 






595 










600 








605 






Leu 


Val 


Asn 


Val 


Asp Leu Asp 


Asp 


Cys 


He 


Phe Glu 


Gin 








610 










615 








620 









<210> 170 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep 52 420 GCT 



<400> 170 



Met Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly 


He 


Thr 


Ser Glu Lys 


1 






5 










10 








15 


Gin Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn Ala Ala 




20 










25 










30 


Ser Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 




35 










40 










45 




He Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr 


Leu 


Val 


Gly Gin Gin 


50 








55 










60 






Pro Val 


Glu 


Asp 


lie 


Ser 


Ser 


Asn 


Arg 


He 


Tyr 


Lys 


He 


Leu Glu Leu 


65 








70 










75 






80 


Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


Leu 


Gly Trp Ala 








85 










90 








95 


Thr Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe 


Gly Pro Ala 



100 105 110 
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Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He Ala 


Glu 


Ala 


He Ala His Thr 


Val 


Pro 






1 1 r 

115 






120 






125 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp Thr 


Asn 


Glu 


Asn Phe Pro Phe 


Asn Asp 




130 








135 






140 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He Trp Trp Glu Glu Gly Lys Met 


Thr 


Ala 


145 










150 








155 




160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys Ala 


He 


Leu Gly Gly Ser Lys 


Val 


Arg 








165 








170 




175 




Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser Ser 


Ala 


Gin 


He Asp Pro Thr 


Pro 


Val 






180 








185 




190 






He 


Val 


Thr 


Ala 


Asn 


Thr 


Asn Met 


Cys 


Ala 


Val He Asp Gly 


Asn 


Ser 






195 








200 






205 






Thr 


Thr 


Phe 


Glu 


His 


Gin Gin Pro Leu Gin Asp Arg Met Phe 


Lys 


Phe 




210 










215 






220 






Glu 


Leu 


Thr 


Arg Arg 


Leu 


Asp His 


Asp 


Phe 


Gly Lys Val Thr 


Lys 


Gin 


225 










230 








235 




240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg Trp Ala Lys 


Asp His Val Val 


Glu Val 










245 








250 




255 




Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys Lys 


Gly Gly Ala Lys Lys Arg 


Pro 


Ala 








260 








265 




270 






Pro 


Ser 


Asp 


Ala Asp 


He 


Ser Glu 


Pro Lys 


Arg Val Arg Glu 


Ser 


Val 






275 








280 






285 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp Ala 


Glu 


Ala 


Ser He Asn Tyr 


Ala Asp 




290 










295 






300 






Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser Arg His Val 


Gly Met Asn Leu 


Met 


Leu 


305 










310 








315 




320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu Arg Met Asn Gin Asn Ser Asn 


He 


Cys 










325 








.330 




335 




Phe 


Thr 


His 


Gly Gin 


Lys 


Asp Cys 


Leu 


Glu 


Cys Phe Pro Val 


Ser 


Glu 








340 








345 




350 






Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val Lys 


Lys 


Ala 


Tyr Gin Lys Leu 


Cys 


Tyr 






355 








360 






365 






He 


His 


His 


He 


Met 


Gly Lys Val 


Pro 


Asp 


Ala Cys Thr Ala 


Cys 


Asp 




370 










375 






380 






Leu 


Val 


Asn 


Val 


Asp 


Leu 


Asp Asp 


Cys 


He 


Phe Glu Gin 






385 










390 








395 






<210> 171 




















<211> 536 




















<212> PRT 




















<213> Artificial Sequence 










• 


<220> 






















<223> Mutant rep protein: rep 6 8 


420 


GCT 








<400> 171 




















Thr 


Ala 


Gly 


Phe 


Tyr 


Glu 


He Val 


He 


Lys 


Val Pro Ser Asp 


Leu 


Asp 


1 








5 








10 




15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser Asp 


Ser 


Phe 


Val Asn Trp Val 


Ala 


Glu 








20 




25 




30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro Asp 


Ser Asp . 


Met Asp Leu Asn 


Leu 


He 






35 








40 






45 






Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val Ala 


Glu 


Lys 


Leu Gin Arg Asp 


Phe 


Leu 




50 










55 




60 






Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser Lys 


Ala 


Pro 


Glu Ala Leu Phe 


Phe 


Val 


65 










70 








75 




80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser Tyr Phe His 


Met His Val Leu 


Val 


Glu 








85 








90 




95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met Val 


Leu Gly Arg Phe Leu Ser 


Gin 


He 








100 








105 




110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg He 


Tyr Arg 


Gly He Glu Pro 


Thr 


Leu 
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115 






120 


riu 


A c;n 

noil 


Ttt* "PHf^ Al a 

X. _L kJ XT 11V^ !■ vA 


Val 


Thr 


Lys 




130 




135 






T.vc 

Xiys 


T7al Val A«3"n 

VctX V d-L . nap 


Glu 


Cys 


Tyr 


145 




150 






T* Vi 

11137 




riO VJX IX XJtrU. 


Gin Trp Ala 






-L, O J 








ber 


ax a 


*jys Lieu .H.SIJ. 


Leu 


Thr 


Glu 






inn 








Leu 


Tnr 


xiis val ber 


Gin 


Thr 


Gin 






J. 33 






200 


Pro 


Asn 


Ser Asp Ala 


Pro 


Val 


He 




21U 




215 




Met 


GlU 


Leu vaj. vj-Ly 


Trp 


Leu 


Val 


ZZ z> 






230 






Gin 


Trp 


lie Gin Glu 


Asp Gin Ala 












Ser 


Asn 


Ser Arg ber 


Gin 


lie 


Lys 






o «r a 
260 








lie 


Met 


Ser Leu Tnr 


Lys 


Thr 


Ala 










280 


Pro 


val 


Glu Asp lie 


Ser 


Ser 


Asn 




*5 Q A 




295 




Asn 


Gly 


Tyr Asp Pro 


Gin Tyr Ala 


^ rt c 






310 






inr 


T *M 

jjys 


LyS rile Lrly 


Lys 


Arg Asn 






3Zz> 








Thr 


Tnr 


Gly Lys l nr 


Asn 


He 


Ala 






t a n 








irlie 


1 y JL 


L3±y LyS val 


Asn Trp 


Thr 






^ ^ 

-J -3 -J 






360 


Cys 


val 


Asp Lys met 


Val 


lie 


Trp 




370 






375 




Lys 


Val 


Val Glu Ser 


Ala 


Lys 


Ala 


385 






390 






Val 


Asp 


Gin Lys Cys 


Lys 


Ser 


Ser 






405 








He 


Val 


Thr Ala Asn 


Thr 


Asn 


Met 






4~20 








Thr 


Thr 


Phe Glu His 


Gin 


Gin 


Pro 






435 






440 


Glu 


Leu 


Thr Arg Arg 


Leu Asp 


His 




450 






455 




Glu 


Val 


Lys Asp Phe 


Phe 


Arg Trp 


465 






470 






Glu 


His 


Glu Phe Tyr 


Val 


Lys 


Lys 






485 








Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 






500 








Ala 


Gin 


Pro Ser Thr 


Ser 


Asp 


Ala 






515 




520 


Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 




53 0 






535 





<210> 172 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



125 



Thr 


Aro 


Asn 


Gly Ala 


Gly 


Gly 


Gly 








140 










He 


Pro 


Asn 


A Y r 




Leu 


Pro 


Lvs 






155 










160 




Thr 




IMC I, 


f2l ii 


Gin 


xyx 


J— LA 














175 






xjy t> 


AT*Cf 


jjeu 


vai 


Ala 


Gin 




X. o 3 










i Qn 

-L -? \J 






Glu 


Gin 

will 


Asn 

noil 


Lys 


vjXU 


Asn 


Gin 

VJ xn 


Asn 










one: 
z U D 










Q pa 


T,VC! 


Thr 


Ser 


Ala 


At* ci 


TVr 








220 










7\ Y'V 

nop 


Liya 


uiy 


lie 


\a -i_r- 

Tnr 


OCX. 


VJlU 


xjy o 






Z O D 










Z*±\J 


o .L 




Tl p» 

IXC 


yn> u 

t>er 


pne 


len 
noil 


Ala 


Ala 




Z jU 










^ jj 




AT a 


Al a 


XIC u 


Asp 


Asn 


Ala 


Glv 


Lvs 


ore 










Z I \J 






nu 


A en 
ASp 


j.yir 


Leu 


vai 


V3j.y 


Gin 


Gin 








O Q tr 
ZOO 








nxg 


He 




Lys 


lie 


DC LA 


Glu 


U UL 








inn 










Ala 


iJCL 


V IT1 Jm 


Til, — . 

Pile 


Leu 


Glv 


1L r 


Ala 
















J /. u 


X J- i -L. 


Tie 


Tim 


Leu 




Glv 


Pro 


Ala 




•so r» 
J J u 








J J J 




Glu 


Ala 


He 


Ala 


His 


Thr 

•fc All 


Val 


Pro 


345 










350 






Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 










365 








Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 








380 










He 


Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Arg 






395 










400 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 




410 








415 




Cys 


Ala 


Val 


He 


Asp 


Gly 


Asn 


Ser 


425 










430 






Leu 


Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 










445 








Asp 


Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 








460 










Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 




475 










480 


Gly 


Gly 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 




490 










495 




Pro 


Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 


505 










510 






Glu 


Ala 


Ser 


He 


Asn 


Tyr 


Ala 


Asp 



525 



420 GCT 
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<400> 172 


Met 


Glu 


Leu Val 


1 






Gin 


Trp 


He Gin 




20 


Ser 


Asn 


Ser Arg 






35 


He 


Met 


Ser Leu 




50 






V CX J*. 


C5"Lu Aso 


65 






Asn 


Gly 


Tyr Asp 


Thr 


Lys 


Lys Phe 






100 


Thr 


Thr 


Gly Lys 






115 


Phe 


Tyr 


Gly Cys 




130 




Cys 


Val 


Asp Lys 


145 






Lys 


Val 


Val Glu 


Val 


Asp 


Gin Lys 






180 


He 


Val 


Thr Ala 






195 


Thr 


Thr 


Phe Glu 




210 




Glu 


Leu 


Thr Arg 


225 






Glu 


Val 


Lys Asp 


Glu 


His 


Glu Phe 






260 


Pro 


Ser 


Asp Ala 






275 


Ala 


Gin 


Pro .Ser 




290 




Arg 


Leu 


Ala Arg 


305 







Gly Trp Leu Val 
5 

Glu Asp Gin Ala 

Ser Gin He Lys 

40 

Thr Lys Thr Ala 
55 

He Ser Ser Asn 
70 

Pro Gin Tyr Ala 
85 

Gly Lys Arg Asn 

Thr Asn He Ala 

120 

Val Asn Trp Thr 
135 

Met Val He Trp 
150 

Ser Ala Lys Ala 
165 

Cys Lys Ser Ser 

Asn Thr Asn Met 

200 

His Gin Gin Pro 
215 

Arg Leu Asp His 
230 

Phe Phe Arg Trp 
245 

Tyr Val Lys Lys 

Asp He Ser Glu 

280 

Thr Ser Asp Ala 
295 

Gly His Ser Leu 
310 



Asp Lys Gly He 
10 

Ser Tyr He Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 

60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 



Thr Ser Glu Lys 
15 

Phe Asn Ala Ala 
30 

Asn Ala Gly Lys 
45 

Val Gly Gin Gin 

He Leu Glu Leu 

80 

Leu Gly Trp Ala 
95 

Phe Gly Pro Ala 
110 

His Thr Val Pro 
125 

Pro Phe Asn Asp 

Lys Met Thr Ala 

160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 

Met Phe Lys Phe 

Val Thr Lys Gin 

240 

Val Val Glu Val 
255 

Lys Arg Pro Ala 
270 

Arg Glu Ser Val 
285 

Asn Tyr Ala Asp 



<210> 173 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 421 GCC 
<400> 173 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 ~* 70 75 80 
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Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 










Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 










Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


He 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


JUJ 








Thr 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 




Glu 


Lys 


Val 


Val 


385 








Val 


Asp 


Gin 


Lys 


He 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 


Thr 


Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


Gin 


Asn 




530 






Phe 


Pro 


Cys 


Arg 


545 








Phe 


Thr 


His 


Gly 



Gly Glu Ser Tyr 
85 

Lys Ser Met Val 

He Gin Arg He 

120 

Ala Val Thr Lys 
135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Ala Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 

Thr Ser Asp Ala 

520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Gin Lys Asp Cys 



Phe 


His 


Met 


HIS 




90 






Leu Gly 


Arg 


Pne 


105 








Tyr Arg 


Gly 


He 


Thr Arg 


Asn 


Gly 








140 


He 


Pro 


Asn 


Tyr 






155 




Trp Thr 


Asn 


Met 




170 






Arg 


Lys 


Arg 


Leu 


185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


Thr 








220 


Asp 


Lys 


Gly 


He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 








Pro Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 






315 




Thr 


He 


Trp 


Leu 




330 






Glu 


Ala 


He 


Ala 


345 








Asn 


Glu 


Asn 


Phe 


Trp 


Glu 


Glu 


Gly 








380 


He 


Leu 


Gly 


Gly 






395 




Ala 


Gin 


He 


Asp 




410 






Cys 


Ala 


Val 


He 


425 








Leu 


Gin 


Asp 


Arg 


Asp 


Phe 


Gly 


Lys 








460 


Ala 


Lys 


Asp 


His 






475 




Gly Gly 


Ala 


Lys 




490 






Pro 


Lys 


Arg 


Val 


505 








Glu 


Ala 


Ser 


He 


His 


Val 


Gly 


Met 








540 


Met 


Asn 


Gin 


Asn 






555 




Leu 


Glu 


Cys 


Phe 



Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
525 

Asn Leu Met Leu 

Ser Asn He Cys 

560 

Pro Val Ser Glu 
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565 

Ser Gin Pro Val Ser Val Val Lys 

580 

He His His He Met Gly Lys Val 
595 600 
Leu Val Asn Val Asp Leu Asp Asp 
610 615- 



570 575 
Lys Ala Tyr Gin Lys Leu Cys Tyr 
585 590 
Pro Asp Ala Cys Thr Ala Cys Asp 

605 

Cys He Phe Glu Gin 

620 



<210> 174 
<211> 397 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep52 421 GCC 



<400> 174 




Met 


Glu 


Leu 


Val 


1 








Gin 


Trp 


He 


Gin 






20 


Ser 


Asn 


Ser 


Arg 






35 




He 


Met 


Ser 


Leu 




50 






Pro 


Val 


Glu 


Asp 


65 








Asn 


Gly 


Tyr 


Asp 


Thr 


Lys 


Lys 


Phe 








100 


Thr 


Thr 


Gly 


Lys 






115 




Phe 


Tyr 


Gly 


Cys 




130 






Cys 


Val 


Asp 


Lys 


145 








Lys 


Val 


Val 


Glu 


Val 


Asp 


Gin 


Lys 








180 


He 


Val 


Thr 


Ser 






195 




Thr 


Thr 


Phe 


Glu 




210 






Glu 


Leu 


Thr 


Arg 


225 








Glu 


Val 


Lys 


Asp 


Glu 


His 


Glu 


Phe 








260 


Pro 


Ser 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys 


Arg 


Phe 


Thr 


His 


Gly 








340 



Gly 


Trp 


Leu 


vai 


5 








Glu 


Tt 

Asp 


G±n 


Ala 


faer 


(jrJLn 


lie 


L»ys 








40 


Thr 


Lys 


Thr 


Ala 




55 




He 


Ser 


Ser 


Asn 




70 






Pro 


Gin 


Tyr 


Ala 


85 








Gly 


Lys 


Arg 


Asn 


Thr 


Asn 


He 


Ala 








120 


Val 


Asn 


Trp 


Thr 






135 




Met 


Val 


He 


Trp 




150 






Ser 


Ala 


Lys 


Ala 


165 








Cys 


Lys 


Ser 


Ser 


Ala 


Thr 


Asn 


Met 








200 


His 


Gin 


Gin 


Pro 






215 




Arg 


Leu 


Asp 


His 




230 






Phe 


Phe 


Arg 


Trp 


245 








Tyr 


Val 


Lys 


Lys 


Asp 


He 


Ser 


Glu 






280 


Thr 


Ser 


Asp 


Ala 






295 




Lys 


Cys 


Ser 


Arg 




310 






Gin 


Cys 


Glu 


Arg 


325 








Gin 


Lys 


Asp 


Cys 



Asp Lys Gly He 
10 

Ser Tyr He Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 

60 

Arg He Tyr Lys 
75 

Ala Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 

140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 

220 

Asp Phe Gly Lys 
235 

Ala Lys Asp His 
250 

Gly Gly Ala Lys 
265 

Pro Lys Arg Val 

Glu Ala Ser He 

300 

His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 



Thr 


Ser 


Glu 


iiys 






15 




Fne 


Asn 


Ala 


Ala 




30 






Asn 


Ala 


Gly 


iiys 


45 








Val 


Gly Gin 


Gin 


He 


Leu 


Glu 


Leu 








80 


Leu 


Gly Trp 


Ala 






95 




Phe 


Gly Pro 


Ala 




110 






His 


Thr 


Val 


Pro 


125 








Pro 


Phe 


Asn 


Asp" 


Lys 


Met 


Thr 


Ala 






160 


Ser 


Lys 


Val 


Arg 






175 




Pro 


Thr 


Pro 


Val 




190 






Asp 


Gly Asn 


Ser 


205 








Met 


Phe 


Lys 


Phe 


Val 


Thr 


Lys 


Gin 








240 


Val 


Val 


Glu 


Val 






255 




Lys 


Arg 


Pro 


Ala 




270 






Arg 


Glu 


Ser 


Val 


285 








Asn 


Tyr 


Ala 


Asp 


Asn 


Leu 


Met 


Leu 








320 


Ser 


Asn 


He 


Cys 






335 




Pro 


Val 


Ser 


Glu 




350 
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Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys Ala 


Tyr 


Gin 


T.i/a Ttfan 

J_l y O LIC Li 


V— y tj i j i 






355 










360 








— > O — > 




lie 


His 


His 


He 


Met Gly Lys Val 


Pro Asp Ala Cys 


X L1L. filet 






370 










375 








380 






Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 


Cys He 


Phe 


Vj-LU 


VJ7i.Il 




385 










390 








395 








<210> 175 






















<211> 536 






















<212> PRT 






















<213> Artificial Sequence 














<220> 
























<223> Mutant rep protein: rep68 


421 GCC 










<400> 175 






















Thr 


Ala 


Gly 


Phe 


Tvr 


Glu 


He 


Val 


He Lys 


Val 


Pro 


Ser Asp 


Leu Asp 


1 






c 

•J 








10 








15 


Glu 


His 


Leu 


Prn 


Glv 


He 

J* v_ 


Ser 




Ser Phe 


Val 


Asn 


Trp Val 


Ala Glu 
















25 






30 




Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp 


Met 


Asp 


Leu Asn 


Leu He 




35 










40 








45 




Glu 


Gin 


Ala 


IT -L W 


uc u 


Thr 


Val 


Ala 


Glu Lys 


Leu 


Gin 


Arg Asp 


Phe Leu 




50 










^ -j 








60 






Thr 


Glu 


Trp 


Arcr 


Arcr 


Val 


Ser 


Lys 


Ala Pro 


Glu 


Ala 


Leu Phe 


Phe Val 


65 




70 








75 






80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His 


Met 


His 


Val Leu 


Val Glu 








85 








90 








95 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg 


Phe 


Leu Ser 


Gin He 






100 








105 






110 




Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr Arg Gly 


He 


Glu Pro 


Thr Leu 




115 










120 








125 




Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly 


Ala Gly Gly Gly 




130 








135 








140 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro 


Asn 


Tyr 


Leu Leu 


Pro Lys 


145 








150 






155 






160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr 


Asn 


Met 


Glu Gin Tyr Leu 










165 






170 








175 


Ser 


Ala 


Cys 


lieu 


Asn 


Leu 


Thr 


Glu 


Arg Lys 


Arg 


Leu 


Val Ala 


Gin His 






180 










185 






190 




Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin 


Asn 


Lys 


Glu Asn 


Gin Asn 






195 










200 








205 




Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg Ser 


Lys 


Thr 


Ser Ala Arg Tyr 




210 










215 








220 






Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys Gly He 


Thr Ser 


Glu Lys 


225 










230 








235 






240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser 


Phe Asn 


Ala Ala 








245 








250 








255 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala Leu Asp 


Asn Ala Gly Lys 








260 










265 






270 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu 


Val Gly Gin Gin 






275 










280 








285 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He 


Tyr 


Lys 


He Leu 


Glu Leu 




290 










295 








300 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser 


Val 


Phe 


Leu Gly Trp Ala 


305 








310 








315 






320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He 


Trp 


Leu 


Phe Gly Pro Ala 








325 








330 








335 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu Ala 


He 


Ala 


His Thr 


Val Pro 






340 










345 






350 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn Glu 


Asn 


Phe 


Pro Phe Asn Asp 
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355 










360 

\J 




vax 
370 


Asp 


Lys 


Met 


Vol 


lie 

375 


irp 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 
405 


Lys 


Ser 


Ser 


He 


Val 


Thr 


Ser 
420 


Ala 


Thr 


Asn 


Met 


Thr 


Thr 


Phe 
435 


Glu 


His 


Gin 


Gin 


Pro 
440 


Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 




450 










455 




Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 
485 


Val 


Lys 


Lys 


Pro 


Ser 


Asp 


Ala 
500 


Asp 


He 


Ser 


Glu 


Ala 


Gin 


Pro 
515 


Ser 


Thr 


Ser 


Asp 


Ala 
520 


Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 




530 










535 





<210> 176 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 



<400> 176 



Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val. 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 










40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Ala 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 
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365 








Trp 


Glu Glu Gly 


Lys 


Met 


Thr 


Ala 




380 










He 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 




395 








400 


Ala 


Gin He Asp 


Pro 


Thr 


Pro 


Val 




410 






415 




Cys 


Ala Val He 


Asp 


Gly 


Asn 


Ser 


425 






430 






Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 






AA 

T: Zj 








Asp 


Phe Gly Lys 


Val 


Thr 


Lys 


Gin 




460 

" U v 










Aid 


Lys asp xii s 


Vdl 


V dl 


Cll ii 


V al 




475 












lariy Aid jjys 


jjys 


Aiy 


clU 


Al » 
rtl ci 




A QO 
*± \J 






495 




jrro 


jjys Axy vai 


Arg 


ulU 


O d >— 

OCl 


Vdl 








510 






ValU 


Aia ber lie 


11 


±yr 


Al 

nld 








—J -C* —> 








421 


■ 












Tivct CZ~] \r Tip 


Thr 


Ser 


Glu 


Lvs 




10 






15 




OCX 


TVrr* Tip CpT" 
Ijrl lie 


Phe 




Ala 


Ala 


25 




30 






Al a 
/Aid 


Si a Ti^n Han 
Ala UCU £\Z~>^J 




Ala 


Glv 


Lvs 






45 








Pro 


Asp Tyr Leu 


Val 


Gly 


Gin 


Gin 




60 










Arg 


He Tyr Lys 


He 


Leu 


Glu 


Leu 




75 








80 


Ala 


Ser Val Phe 


Leu 


Gly 


Trp 


Ala 




90 






95 




Thr 


He Trp Leu 


Phe 


Gly 


Pro 


Ala 


105 






110 






Glu 


Ala He Ala 


His 


Thr 


Val 


Pro 






125 








Asn 


Glu Asn Phe 


Pro 


Phe 


Asn 


Asp 




140 










Trp 


Glu Glu Gly 


Lys 


Met 


Thr 


Ala 


155 








160 


He 


Leu Gly Gly 


Ser 


Lys 


Val 


Arg 




170 






175 




Ala 


Gin He Asp 


Pro 


Thr 


Pro 


Val 


185 




190 






Cys 


Ala Val He 


Asp 


Gly 


Asn 


Ser 






205 








Leu 


Gin Asp Arg 


Met 


Phe 


Lys 


Phe 
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Glu Leu Thr Arg Arg Leu Asp His 
225 230 
Glu Val Lys Asp Phe Phe Arg Trp 

245 

Glu His Glu Phe Tyr Val Lys Lys 

260 

Pro Ser Asp Ala Asp lie Ser Glu 
275 280 
Ala Gin Pro Ser Thr Ser Asp Ala 

290 295 
Arg Leu Ala Arg Gly His Ser Leu 
305 310 



Asp 


Phe Gly Lys 


Val 


Thr 


Lys 


Gin 




235 








240 


Ala 


Lys Asp His 


Val 


Val 


Glu 


Val 




250 






255 




Gly 


Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 


265 






270 






Pro 


Lys Arg Val 


Arg 


Glu 


Ser 


Val 






285 








Glu 


Ala Ser lie 


Asn Tyr Ala Asp 




300 











<210> 177 
<211> 621 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutant rep protein: rep78 422 GCC 



<400> 177 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 








100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


lie 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








120 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 








280 


lie 


Ser 


Ser 


Asn 






295 




-Pro 


Gin 


Tyr 


Ala 



He Lys Val Pro 
10 

Ser Phe Val Asn 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 

60 

Ala Pro Glu Ala 
75 

Phe His Met His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 

Thr Arg Asn Gly 

140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 

220 

Asp Lys Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Lys 

300 

Ala Ser Val Phe 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 
Leu Gly Trp Ala 
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305 310 



inr 


jjys 


Lys 


n't, q 

fne 


oiy 


Lys Arg 


21 CSTl 

noli 


X Xli. 










1 o c 

325 










mL — 

Tnr 


Tnr 


-G±y 


Lys 


inr 


Asn 


He 


/via 


ulu 






340 










O ft D 


Phe 


Tyr 


Gly 




XT -j T 

Val 


Asn Trp 


X iUL 


7\ o n 




355 










*} C f\ 

360 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


irp 


irp 


370 








375 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


ne 


385 










390 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 










405 










lie 


Val 


Thr 


Ser 


Asn 


Ala 


Asn 


Met 


Cys 








420 










425 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 






435 










440 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 




450 










455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


465 










470 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 










485 










Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 






500 










505 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 


Glu 






515 










520 




Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


His 


530 










535 






Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 


Met 


545 










550 








Phe 


Thr 


His 


Gly Gin 


Lys 


Asp 


Cys 


Leu 










565 










Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys 








580 










585 


He 


His 


His 


He 


Met 


Gly Lys 


Val 


Pro 






595 










600 




Leu 


Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 


Cys 




610 










615 







<210> 178 

<211> 397 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 422 



<400> 178 



Met Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu Val Asp 


1 

Gin Trp 


He 


Gin 


Glu 


Asp 


Gin Ala Ser 




20 






25 


Ser Asn 


Ser 


Arg 


Ser 


Gin 


He Lys Ala 




35 








40 


He Met 


Ser 


Leu 


Thr 


Lys 


Thr Ala Pro 


50 










55 


Pro Val 


Glu 


Asp 


He 


Ser 


Ser Asn Arg 


65 








70 




Asn Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr Ala Ala 








85 
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31 5 










320 


lie 


± rp 


Leu Phe Gly Pro Ala 


330 








335 




Aid 


Tl A 

lie 


Ala 


His 


Thr 
350 


Val 


Pro 


VjlU 


Asn 


Phe 


Pro 


Phe 


Asn Asp 








365 








GlU 


GlU 


Gly Lys 


Met 


Thr 


Ala 






380 










Leu 


Gly 


Gly Ser Lys 


Val 


Arg 




395 










400 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 


410 








415 




Ala 


Val 


He Asp Gly Asn Ser 










430 






Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 






445 








Phe 


Gly 


Lys 
460 


Val 


Thr 


Lys 


Gin 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 


475 










480 


Gly 


Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


490 










495 




Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 








510 






Ala 


Ser 


He 


A3n Tyr 


Ala 


Asp 








525 








Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 




540 










Asn 


Gin 
555 


Asn 


Ser 


Asn 


He 


Cys 
560 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 


570 








575 




Ala 


Tyr 


Gin 


Lys 


Leu 
590 


Cys 


Tyr 


Asp 


Ala 


Cys 


Thr 
605 


Ala 


Cys 


Asp 


He 


Phe 


Glu 


Gin 









620 



GCC 



Lys 


Gly 


He Thr Ser Glu Lys 


10 




15 


Tyr 


He 


Ser Phe Asn Ala Ala 




30 


Ala 


Leu 


Asp Asn Ala Gly Lys 






45 


Asp 


Tyr 


Leu Val Gly Gin Gin 






60 


He 


Tyr 


Lys He Leu Glu Leu 




75 


80 


Ser 


Val 


Phe Leu Gly Trp Ala 


90 




95 
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Thr 




Lys 


Phe 


Glv 


Lys Arg Asn Thr 




100 










105 


TVit* 


T'V|-*~ 


Gly Lys 


Thr 


A cj n 
noil 


Tie 

-L J- w 


Ala 


Glu 

w JU \Jk 






115 










120 




rue 


xyx 


Gly Cys 


Val 


Asn 


Tro 


Thr 


Asn 




130 










XJ J 






LyS 


Val 


Asp 


Lys 


Met 


vctx 


lie 


Trp 


Trp 


1/1 c 

JLft _> 








XD\J 








LtyS 


Vai. 


Val 


Glu 


Ser 


7\ 1 -a 


Lys 


Ala 


He 








165 










vai 


1\ j^i "ir*i_ 

ASp 


Gin Lys 


Cys 


jjys 


Ser 


Ser 


Ala 






180 










185 


Tl _ 

lie 


vax 


Thr 


Ser 


Asn 


/vJ.a 


Asn 


Met 


Cys 






195 










200 




rp1_ -v~ 


inr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 














215 


His 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


Asp 


225 










230 








Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 










245 










Glu 


His 


Glu 
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Phe Pro Cys Arg 
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Phe Thr His Gly 

Ser Gin Pro Val 
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Gin Lys Asp Cys 
565 

Ser Val Val Lys 

Met Gly Lys Val 
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Asp Leu Asp Asp 
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555 

Leu Glu Cys Phe 
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Lys Ala Tyr Gin 
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Pro Asp Ala Cys 

Cys lie Phe Glu 
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Ser Asn lie Cys 
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Pro Val Ser Glu 
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Lys Leu Cys Tyr 
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Thr Ala Cys Asp 
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Gin 
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Val 
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Val 


Asn 


Trp 
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Met 


Val 
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Met 
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Val 
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Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He Leu Gly 


Gly 
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Lys 


Ser 
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Thr 
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180 








185 








190 






He 


Val 


Thr 


Ser 


Asn 
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Phe 


Lys 
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Arg 
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Val 


Thr 
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Val 
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Leu 


Arg 


Tyr 


Gin 


Asn 


Lys 


Cys 


Ser 


Arg 


His Val 


Gly Met 


Asn 


Leu 


Met 


305 






310 








315 








He 


320 


Phe 


Pro 


Cys 


Arg 


Gin 


Cys 


Glu 


Arg 


Met Asn 


Gin 


Asn 


Ser 


Asn 


Cys 
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325 330 335 



Phe 


Thr 


His 


Gly 


Gin 


Lys 


Asp 


Cys 


Leu Glu 


Cys 


Phe Pro Val 


Ser 


Glu 








340 








345 




350 






Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Lys 


Lys Ala Tyr Gin Lys Leu 


Cys 


Tyr 






355 










360 






365 






lie 


His 


His 


He 


Met 


Gly Lys 


Val 


Pro Asp Ala Cys Thr Ala 


Cys 


Asp 




370 










375 








380 






Leu 


Val 


Asn 


Val 


Asp 


Leu Asp Asp 


Cys He 


Phe 


Glu Gin 






385 








390 








395 








<210> 183 






















<211> 536 






















<212> PRT 






















<213> Artificial Sequence 














<220> 
























<223> Mutant rep protein: rep68 


424 GCG 










<400> 183 






















Jm [11 


Ala 


Gly 


Phe 


Tyr 


Glu 


He 


Val 


He Lys 


Val 


Pro Ser Asp 


Leu 


Asp 








5 








10 






15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser Phe 


Val 


Asn Trp Val 


Ala 


Glu 








20 








25 




30 






Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser Asp 


Met 


Asp Leu Asn 


Leu 


lie 




35 










40 






45 


Phe 




Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu Lys 


Leu Gin Arg Asp 


Leu 




50 










55 








60 




Val 


Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala Pro 


Glu 


Ala Leu Phe 


Phe 


65 




70 








75 




Val 


80 


Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 


Phe His 


Met 


His Val Leu 


Glu 








85 








90 






95 


He 


Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Leu Gly Arg 


Phe Leu Ser 


Gin 






100 








105 




110 






Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 


Tyr Arg Gly 


He Glu Pro 


Thr 


Leu 




115 










120 






125 






Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr Arg Asn Gly Ala Gly 


Gly 


Gly 




130 








135 








140 






Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


He Pro 


Asn 


Tyr Leu Leu 


Pro 


Lys 


145 






150 








155 






160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp Thr Asn 


Met Glu Gin 


Tyr 


Leu 










165 






170 






175 


His 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 


Arg Lys Arg 


Leu Val Ala 


Gin 






180 










185 




190 






Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 


Glu Gin 


Asn 


Lys Glu Asn 


Gin 


Asn 






195 










200 






205 






Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg Ser 


Lys 


Thr Ser Ala 


Arg 


Tyr 




210 








215 








220 


Glu 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp Lys 


Gly 


He Thr Ser 


Lys 


225 








230 








235 




Ala 


240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser Tyr 


He 


Ser Phe Asn 


Ala 








245 






250 






255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala Ala 


Leu 


Asp Asn Ala 


Gly 


Lys 








260 








265 




270 




Gin 


lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro Asp Tyr Leu Val Gly 


Gin 






275 








280 






285 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg He 


Tyr 


Lys He Leu 


Glu 


Leu 




290 








295 








300 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala Ser 


Val 


Phe Leu Gly 


Trp 


Ala 


305 




310 








315 






320 


Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr He 


Trp 


Leu Phe Gly 


Pro 


Ala 






325 






330 






335 
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Glv 


Lys 


Thr 


Asn 


lie 


Ala 


Glu 






340 










345 






Glv 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 






355 










360 




Cys 


Val 


Asp 


Lys 


Met 


Val 


lie 


Trp 


Trp 


370 










375 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


385 










390 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 








405 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Ala 


Cys 








420 










425 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 






435 










440 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


Asp 




450 










455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 


Ala 


465 










470 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 


Gly 










485 










Pro 


Ser 


Asp 


Ala Asp 


lie 


Ser 


Glu 


Pro 






500 










505 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 






515 










520 




Arg 


Leu 


Ala 


Arg Gly His 


Ser 


Leu 





530 535 

<210* 184 
<211> 312 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 424 



<400> 184 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


Asp 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 










25 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 






35 










40 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 










55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


Arg 


65 










70 








Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 








85 










Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 








100 










105 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






115 










120 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 




130 










135 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


145 










150 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 








165 










Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 






180 










185 


He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Ala 


Cys 



PCT7IB02/04087 



-220- 



Ala 


He 


Ala His Thr Val 


Pro 






350 




Glu 


Asn 


Phe Pro Phe Asn Asp 






365 




Glu 


Glu 


Gly Lys Met Thr 


Ala 






380 




Leu 


Gly 


Gly Ser Lys Val 


Arg 




395 




400 


Gin 


He 


Asp Pro Thr Pro 


Val 


410 




415 




Ala 


Val 


He Asp Gly Asn 


Ser 






430 




Gin 


Asp 


Arg Met Phe Lys 


Phe 






445 




Phe 


Gly 


Lys Val Thr Lys 


Gin 






460 




u y •» 




His Val Val Glu 


Val 


475 




480 


\jJLy 




Lys Lys Arg Pro 


Ala 


4 90 
*i \j 




495 








Val Arg Glu Ser 


Val 




510 




nlcl 


O Ci. 


He Asn Tyr Ala Asp 






525 




GCG 








Lvs 


Glv 


He Thr Ser Glu 


Lys 


10 


15 






He 


Ser Phe Asn Ala 


Ala 




30 




Ala 


Leu 


Asp Asn Ala Gly Lys 






45 




Asp 


Tyr 


Leu Val Gly Gin Gin 






60 




He 


Tyr 


Lys He Leu Glu 


Leu 




75 




80 


Ser 


Val 


Phe Leu Gly Trp 


Ala 


90 




95 




He 


Trp 


Leu Phe Gly Pro 


Ala 




110 




Ala 


He 


Ala His Thr Val 


Pro 






125 




Glu 


Asn 


Phe Pro Phe Asn 


Asp 






140 




Glu 


Glu 


Gly Lys Met Thr 


Ala 




155 


160 


Leu 


Gly 


Gly Ser Lys Val 


Arg 


170 




175 




Gin 


He 


Asp Pro Thr Pro 


Val 






190 




Ala 


Val 


He Asp Gly Asn 


Ser 
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195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 








260 










Pro 


Ser 


Asp Ala Asp 


He 


Ser 


Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser Asp 


Ala 




290 










295 




Arg 


Leu 


Ala 


Arg 


Gly 


His 


Ser 


Leu 


305 










310 






<210> 185 












<211> 621 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> Mutant rep protein: rep78 


<400> 185 












Thr Ala Gly 


Phe 


Tyr 


Glu 


He 


Val 


1 








5 








Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 




50 










55 




Thr 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


65 










70 






Gin 


Phe 


Glu 


Lys 


Gly 


Glu 


Ser 


Tyr 








85 








Thr 


Thr 


Gly 


Val" 


Lys 


Ser 


Met 


Val 








100 










Arg 


Glu 


Lys 


Leu 


He 


Gin 


Arg 


He 






115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 










135 




Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr 


145 










150 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 






Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 




210 








215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 








245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 








260 










He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 








280 





205 








Leu Gin Asp Arg 


Met 


Phe 


Lys 


Phe 


220 










Asp Phe Gly Lys 


Val 


Thr 


Lys 


Gin 


235 








240 


Ala Lys Asp His 


Val 


Val 


Glu 


Val 


250 






255 




Gly Gly Ala Lys 


Lys 


Arg 


Pro 


Ala 


265 




270 






Pro Lys Arg Val 


Arg 


Glu 


Ser 


Val 




285 








Glu Ala Ser He 


Asn 


Tyr 


Ala 


Asp 



300 



42 8 GCT 



He Lys Val 


Pro 


Ser 


Asp 


Leu 


Asp 


10 








15 




Ser Phe Val 


Asn 


Trp 


Val 


Ala 


Glu 


25 






30 






Ser Asp Met 


Asp 


Leu 


Asn 


Leu 


He 




45 








Glu Lys Leu 


Gin 


Arg 


Asp 


Phe 


Leu 


60 










Ala Pro Glu 


Ala 


Leu 


Phe 


Phe 


Val 


75 










80 


Phe His Met 


His 


Val 


Leu 


Val 


Glu 


90 








95 




Leu Gly Arg 


Phe 


Leu 


Ser 


Gin 


He 


105 






110 






Tyr Arg Gly He 


Glu 


Pro 


Thr 


Leu 






125 








Thr Arg Asn Gly 


Ala 


Gly 


Gly 


Gly 




140 










He Pro Asn Tyr 


Leu 


Leu 


Pro 


Lys 


155 










160 


Trp Thr Asn 


Met 


Glu 


Gin 


Tyr 


Leu 


170 








175 




Arg Lys Arg 


Leu 


Val 


Ala 


Gin 


His 


185 






190 






Glu Gin Asn Lys 


Glu 


Asn 


Gin 


Asn 






205 








Arg Ser Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 


220 










Asp Lys Gly 


He 


Thr 


Ser 


Glu 


Lys 


235 










240 


Ser Tyr He 


Ser 


Phe 


Asn 


Ala 


Ala 


250 








255 




Ala Ala Leu Asp 


Asn 


Ala 


Gly 


Lys 


265 






270 






Pro Asp Tyr 


Leu 


Val 


Gly 


Gin 


Gin 




285 
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fro 


vax 


uXU 


Asp 


lie 




Asn Arg 


Tic, 


Tyr Lys He 


Leu 


Glu 






*y Qft 






<£* J Zj 








300 








Asn 


GXy 


Tyr 


Asp 


Pro 




Ala 


Ala 


Cay* 


Val Phe Leu Gly Trp 




one 




Jiv 








315 






320 


Thr 


Lys 


Lys 


Phe 


Gly 


jjys Arg 


Asn 


Thr 


J. J.e 


Trp Leu Phe Gly 


Pro 


7\ 1 -J 

Hid 






325 








ft 

J J u 






335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn xxe 


Ala 


Glu 




He Ala His 


Thr 


Val 


Fro 






340 








345 






350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn Trp 


Thr 


Asn 


bill 


Asn Phe Pro 


Phe 


Asn 


7V 

ASp 




t r r 

355 








360 






365 








Cys 


Val 


Asp 


Lys 


Met 


Val lie 


Trp 


Trp 


Glu 


Glu Gly Lys 


Met 


Thr 


AX a 


370 




375 








380 








Lys 


Val 


Val 


Glu 


Ser 


Ala Lys 


Ala 


He 


T 

Leu 


Gly Gly Ser 


Lys 


Val 


Arg 


385 










390 








395 






A A ft 
•1 tl U 


Val 


Asp 


Gin 


Lys 


Cys 


Lys Ser 


Ser 


Ala 


Gin 


He Asp Pro 


Thr 


Pro 


vax 






405 








410 






415 




lie 


Val 


Thr 


Ser 


Asn 


Thr Asn 


Met 


Cys 


Ala 


Val Ala Asp Gly Asn 


ber 








420 








425 






430 






Thr 


Thr 


Phe 


Glu 


His 


Gin Gin 


Pro 


Leu 


Gin 


Asp Arg Met 


Phe 


Lys 


Pne 






435 








440 






445 








Glu 


Leu 


Thr 


Arg Arg 


Leu Asp 


His 


Asp 


Phe 


Gly Lys Val 


Thr 


Lys 


Gin 




450 








455 








460 








Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg 


Trp Ala 


Lys 


Asp His Val 


Val 


Glu 


vax 


465 








470 








475 






A Q ft 


Glu 


His 


Glu 


Phe 


Tyr 


Val Lys 


Lys 


Gly 


GXy 


Ala Lys Lys 


Arg 


Pro 


ax a 










485 








490 






495 




Pro 


Ser 


Asp 


Ala 


Asp 


lie Ser 


Glu 


Pro 


Lys 


Arg Val Arg 


Glu 


Ser 


Val 






500 








505 






510 






Ala 


Gin 


Pro 


Ser 


Thr 


•Ser Asp 


Ala 


Glu 


Ala 


Ser He Asn Tyr Ala 


Asp 






515 






520 






525 








Arg 


Tyr 


Gin 


Asn 


Lys 


Cys Ser 


Arg 


His 


vax 


Gly Met Asn 


Leu 


Met 


iicU 




530 








535 








540 








Phe 


Pro 


Cys 


Arg 


Gin 


Cys Glu 


Arg 


Met 


Asn 


Gin Asn Ser 


Asn 


He 




545 






550 








555 






sou 


Phe 


Thr 


His 


Gly 


Gin 


Lys Asp 


Cys 


Leu 


blU 


Cys Phe Pro 


Val 


Ser 


r»l 








565 














575 




Ser 


Gin 


Pro 


Val 


Ser 


Val Val 


Lys 


Lys 


AJ.a 


Tyr Gin Lys 


Leu 


Cys 










580 








585 






590 






lie 


His 


His 


lie 


Met 


Gly Lys 


Val 


Pro 


Asp 


Ala Cys Thr 


Ala 


Cys 


Asp 






595 








600 






605 








Leu 


Val 


Asn 


Val 


Asp 


Leu Asp 


Asp 


Cys 


He 


Phe Glu Gin 










610 








615 








620 








<210> 186 






















<211> 397 






















<212> PRT 






















<213> Artificial Sequence 
















<220> 
























<223> Mutant rep protein: rep52 


428 


GCT 










<400> 186 






















Met 


Glu 


Leu 


Val 


Gly Trp Leu 


Val 


Asp 


Lys 


Gly He Thr 


Ser 


Glu 


Lys 


1 








5 








10 






15 




Gin 


Trp 


lie 


Gin 


Glu 


Asp Gin 


Ala 


Ser 


Tyr 


He Ser Phe 


Asn 


Ala 


Ala 






20 








25 






30 






Ser Asn Ser Arg 


Ser 


Gin lie 


Lys 


Ala 


Ala 


Leu Asp Asn Ala Gly 


Lys 






35 








40 






45 








lie 


Met 


Ser 


Leu 


Thr 


Lys Thr 


Ala 


Pro 


Asp 


Tyr Leu Val 


Gly Gin 


Gin 




50 








55 








60 








Pro 


Val 


Glu Asp 


lie 


Ser Ser 


Asn 


Arg 


He 


Tyr Lys He 


Leu 


Glu 


Leu 
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65 








70 








75 








80 


Asn Gly 


Tyr Asp 


Pro Gin Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 




Trp 


Ala 






85 








90 








95 




X 111 




TjVS Phe 


Gly Lys Arg 


Asn 


Thr 


lie 


Trp Leu 


Phe 


Gly 


Pro 


Ala 




100 








105 








110 






TViY" 

X 1 1,1, 


X XXX 


Gly Lys 
115 


Thr 


Asn lie 


Ala 
120 


Glu 


Ala 


lie Ala 


His 
125 


Thr 


Val 


Pro 


rile 


Ayr 


Gly Cys 


Val 


Asn Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe 


Asn Asp 




X _> \J 




135 








140 










vjys 


Val 


Asp Lys 


Met 


Val lie 


Trp 


Trp 


Glu 


Glu Gly Lys 


Met 


x 1 1 x 




JL *± 






150 








155 








160 

•X \-J w 


ijyS 


val 


Val Glu 


Ser 


Ala Lys 


Ala 


lie 


Leu 


Gly Gly Ser 


Lvs 


V cix 








165 








170 








175 




val 


TV <■» 
ASp 


Gin Lys 


Cys 


Lys Ser 


Ser 


Ala 


Gin 


lie Asp 


Pro 


Thr 


Pro 


Val 




180 






185 








190 








vajt 


Thr Ser 


Asn 


Thr Asn 


Met 


Cys Ala 


Val Ala 


Asp 


Glv 


Asn 


Ser 






195 






200 








205 








1x1x7 


i nr 
210 


Phe Glu 


His 


Gin Gin 
215 


Pro 


Leu 


Gin 


Asp Arg 
220 


Met 


Phe 


Lys 


Phe 


Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His 


Asp 


Phe 


Gly Lys 


Val 


Thr 


Lys 


Gin 


225 




230 








235 








240 


Glu 


Val 


Lys Asp 


Phe 


Phe Arg 


Trp Ala 


Lys 


Asp His 


Val 


Val 


Glu 


Val 






245 








250 








255 




Glu 


His 


Glu Phe 


Tyr Val Lys 


Lys 


Gly Gly 


Ala Lys 


Lys 


Arg 


Pro 


Ala 






260 








265 








270 






Pro 


Ser 


Asp Ala Asp 


lie Ser 


Glu 


Pro 


Lys 


Arg Val 


Arg 


Glu 


Ser 


Val 






275 






280 








285 








Ala 


Gin 
290 


Pro Ser 


Thr 


Ser Asp 
295 


Ala 


Glu 


Ala 


Ser lie 
300 


Asn 


Tyr 


Ala 


Asp 


Arg 


Tyr 


Gin Asn 


Lys 


Cys Ser 


Arg His Val 


Gly Met 


Asn 


Leu 


Met 


Leu 


305 






310 








315 








320 


Phe 


Pro 


Cys Arg 


Gin 


Cys Glu 


Arg 


Met 


Asn 


Gin Asn 


Ser 


Asn 


He 


Cys 






325 






330 








335 




Phe 


Thr 


His Gly 


Gin Lys Asp 


Cys 


Leu 


Glu 


Cys Phe 


Pro 


Val 


Ser 


Glu 






340 








345 








350 






Ser 


Gin 


Pro Val 


Ser 


Val Val 


Lys 


Lys 


Ala 


Tyr Gin Lys 


Leu 


Cys 


Tyr 






355 






360 






365 








lie 


His 


His lie 


Met Gly Lys 


Val 


Pro 


Asp 


Ala Cys 


Thr 


Ala 


Cys 


Asp 




370 






375 








380 










Leu 


Val 


Asn Val 


Asp 


Leu Asp 


Asp 


Cys 


lie 


Phe Glu 


Gin 








385 








390 








395 











<210> 187 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep68 428 GCT 
<400> 187 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 
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Gin 


Pne 


Glu 


Lys 


Gly Glu 


Ser 


Tyr Phe His 


Met His Val 


Leu 


Val 


Glu 








85 






90 






95 




Thr 


Thr 


Gly 


Val 


Lys 


Ser 


Met 


Val Leu Gly Arg Phe Leu 


Ser 


Gin 


He 






100 






105 




110 






Arg 


Glu 


Lys 


Leu 


He 


Gin Arg 


He Tyr Arg Gly He Glu 


Pro 


Thr 


Leu 




tic 

lib 










120 


125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr Arg Asn Gly Ala 


Gly Gly Gly 




130 








135 




140 








Asn 


Lys 


Val 


Val 


Asp 


Glu 


Cys 


Tyr He Pro 


Asn Tyr Leu 


Leu 


Pro 


Lys 


145 






150 






155 






160 


Thr 


Gin 


Pro 


Glu 


Leu Gin Trp Ala Trp Thr Asn Met Glu 


Gin 


Tyr 


Leu 










165 






170 






175 




Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Thr Glu Arg Lys 


Arg Leu Val 


Ala 


Gin 


His 






180 








185 




190 






Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin Glu Gin 


Asn Lys Glu 


Asn 


Gin 


Asn 






195 










200 


205 








Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He Arg Ser 


Lys Thr Ser 


Ala Arg Tyr 




210 








215 




220 








Met 


Glu 


Leu 


Val 


Gly Trp 


Leu 


Val Asp Lys 


Gly He Thr 


Ser 


Glu 


Lys 


225 










230 






235 






240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala Ser Tyr 


He Ser Phe 


Asn 


Ala 


Ala 








245 






250 






255 




Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys Ala Ala 


Leu Asp Asn 


Ala 


Gly Lys 








260 








265 




270 






He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala Pro Asp 


Tyr Leu Val 


Gly Gin Gin 






275 








280 


285 








Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg He 


Tyr Lys He 


Leu 


Glu 


Leu 




290 








295 




300 








Asn 


Gly 


Tyr 


Asp 


Pro 


Gin Tyr Ala Ala Ser 


Val Phe Leu 


Gly Trp Ala 


305 




310 






315 






320 


Thr 


Lys 


Lys 


Phe 


Gly Lys 


Arg 


Asn Thr He 


Trp Leu Phe 


Gly 


Pro 


Ala 






325 






330 






335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala Glu Ala 


He Ala His 


Thr 


Val 


Pro 






340 








345 




350 






Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr Asn Glu 


Asn Phe Pro 


Phe 


Asn Asp 




*a c c 










360 


365 








Cys 


Val 


Asp 


Lys 


Met 


Val 


lie Trp Trp Glu Glu Gly Lys 


Met 


Thr 


Ala 


370 










375 




380 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala He Leu Gly Gly Ser 


Lys 


Val 


Arg 


385 










390 






395 






400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser Ala Gin 


He Asp Pro 


Thr 


Pro 


Val 








405 






410 






415 




He 


Val 


Thr 


Ser 


Asn Thr Asn Met Cys Ala Val Ala Asp 


Gly Asn Ser 








420 








425 




430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro Leu Gin Asp Arg Met 


Phe 


Lys 


Phe 






435 










440 


445 








Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His Asp Phe 


Gly Lys Val 


Thr 


Lys 


Gin 




450 










455 




460 








Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg Trp Ala Lys Asp His Val 


Val 


Glu 


Val 


465 










470 






475 






480 


Glu 


His 


Glu 


Phe 


Tyr Val 


Lys 


Lys Gly Gly Ala Lys Lys 


Arg 


Pro 


Ala 










485 




490 


* 




495 




Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser Glu Pro Lys Arg Val Arg 


Glu 


Ser 


Val 








500 








505 




510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala Glu Ala 


Ser He Asn 


Tyr Ala Asp 






515 








520 


525 








Arg 


Leu 


Ala 


Arg 


Gly His 


Ser 


Leu 










530 










535 













<210> 188 
<211> 312 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep40 428 GCT 



<400> 188 



Met 


Glu 


Leu Val 


Gly 


Trp 


Leu 


Val 


Asp 


Lys 


Gly He 


Thr 


Ser Glu Lys 


1 






5 










10 






15 


Gin 


Trp 


He Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He Ser 


Phe 


Asn Ala Ala 




20 










25 








30 


Ser 


Asn 


Ser Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu Asp 


Asn 


Ala Gly Lys 






35 








40 








45 




lie 


Met 


Ser Leu 


Thr 


Lys 


Thr 


Ala 


Pro 


Asp 


Tyr Leu 


Val 


Gly Gin Gin 




50 








55 








60 






Pro 


Val 


Glu Asp 


He 


Ser 


Ser 


Asn Arg 


He 


Tyr Lys 


He 


Leu Glu Leu 


65 






70 










75 




80 


Asn 


Gly 


Tyr Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val Phe 


Leu 


Gly Trp Ala 




85 










90 






95 


Thr 


Lys 


Lys Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp Leu 


Phe 


Gly Pro Ala 




100 










105 








110 


Thr 


Thr 


Gly Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He Ala 


His 


Thr Val Pro 






115 








120 








125 




Phe 


Tyr 


Gly Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn Phe 


Pro 


Phe Asn Asp 




130 






135 








140 






Cys 


Val 


Asp Lys 


Met 


Val 


He 


Trp 


Trp 


Glu Glu Gly Lys 


Met Thr Ala 


145 








150 










155 




160 


Lys 


Val 


Val Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu Gly Gly 


Ser 


Lys Val Arg 






165 










170 






175 


Val 


Asp 


Gin Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He Asp 


Pro 


Thr Pro Val 




180 










185 








190 


He 


Val 


Thr Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val Ala 


Asp 


Gly Asn Ser 






195 








200 








205 




Thr 


Thr 


Phe Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin Asp Arg 


Met 


Phe Lys Phe 




210 








215 








220 






Glu 


Leu 


Thr Arg Arg 


Leu 


Asp 


His 


Asp 


Phe 


Gly Lys 


Val 


Thr Lys Gin 


225 








230 










235 




240 


Glu 


Val 


Lys Asp 


Phe 


Phe 


Arg 


Trp Ala 


Lys Asp His Val 


Val Glu Val 








245 










250 






255 


Glu 


His 


Glu Phe 


Tyr 


Val 


Lys 


Lys 


Gly 


Gly Ala Lys 


Lys 


Arg Pro Ala 






260 








265 








270 


Pro 


Ser 


Asp Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys Arg Val Arg 


Glu Ser Val 






275 






280 








285 




Ala 


Gin 


Pro Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Ser He 


Asn 


Tyr Ala Asp 




290 








295 








300 






Arg 


Leu 


Ala Arg Gly 


His 


Ser 


Leu 












305 








310 

















<210> 189 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep78 429 GCC 
<400> 189 

Thr Ala Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 

20 * 25 30 
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XJjr S3 


Glu 

VJJ. u 


Ttti 

X J- 


Glu 






35 




VJX Li. 


m n 

VJ _L 11 


AT a 


XT X. u 










TT-it- 
J.IUu 




±rp 


«xy 


65 








Gin 


Phe 


Glu 


Lys 


inr 


H ~W» 

inr 


Gly 


Val 








iUU 


Arg 


Glu 


Lys 


T nm* 

Leu 






115 




Pro 


Asn 


Trp 


pne 




130 






Asn 


Lys 


IT- T 

Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 






loll 


Leu 


Thr 


HIS 


val 






± z> _> 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


vai 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


1? /—-v ~V-> 

ber 


Arg 








n r r\ 


lie 


Met 


ber 


Leu 






07c 




Pro 


Val 


nl,. 






290 






Asn Gly 


iyr 




305 








Thr Lys 


Lys 


Phe 


Thr 


Thr 


vjiy 


uys 








"3 >1 A 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 


385 








Val 


Asp 


Gin 


Lys 


He 


Val 


Thr 


Ser 








420 


Thr 


Thr 


Phe 


Glu 






435 




Glu 


Leu 




Arg 




450 






Glu 


Val 


Lys 


Asp 


465 








Glu 


His 


Glu 


Phe 


Pro 


Ser 


Asp 


Ala 








500 


Ala 


Gin 


Pro 


Ser 



Leu Pro Pro Asp 

40 

Leu Thr Val Ala 
' 55 

Arg Val Ser Lys 
70 

Gly Glu Ser Tyr 
85 

Lys Ser Met Val 

He Gin Arg He 

120 

Ala Val Thr Lys 
135 

Asp Glu Cys Tyr 
150 

Leu Gin Trp Ala 
165 

Asn Leu Thr Glu 

Ser Gin Thr Gin 

200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 

280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 

360 

Met Val He Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 

440 

Arg Leu Asp His 
455 

Phe Phe Arg Trp 
470 

Tyr Val Lys Lys 
485 

Asp He Ser Glu 
Thr Ser Asp Ala 
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Ser 


Asp 


Met 


Asp 


Glu 


Lvs 


Leu 


Gin 








60 


Ala 


Pro 


Glu 


Ala 






75 




C Li C 


-El -L O 


lie t«. 






90 










Arg 


Phe 


10s 








Tyr 


Arg 


Gly 


He 


mr 


Arg 


Asn 


Gly 








140 


lie 


.fro 


Asn 


Tyr 






155 




Trp 


inr 


Asn 


Met 




-L / U 






Arg 


iiys 


Arg 


Leu 


185 








Glu 


Gin 


Asn 


Lys 


Arg 


Ser 


Lys 


Thr 








220 


ASp 


i-iyS 


Gly 


He 






235 




oer 


iyr 


He 


Ser 










nJLa 




Leu Asp 


265 








Pro 


Asp 


Tyr 


Leu 


2\ T*r*t 


Tl^ 


Tyr 


Lys 








300 


nl d 


O -v* 
OCX 


Val 


Phe 






315 




JL LIZ. 


lie 


Trp 


Leu 




330 






m 11 


rixd 


He 


Ala 


J 4 b 








Asn 


Glu 


Asn 


Phe 


Trp 


Glu 


Glu Gly 








380 


He 


Leu 


Gly Gly 






395 




Ala 


Gin 


He 


Asp 




410 






Cys 


Ala 


Val 


He 


425 








Leu 


Gin 


Asp 


Arg 


Asp 


Phe 


Gly Lys 








460 


Ala 


Lys 


Asp 


His 






475 




Gly 


Gly 


Ala 


Lys 




490 






Pro 


Lys 


Arg 


Val 


505 








Glu 


Ala 


Ser 


He 



Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 

Leu Gly Trp Ala 

320 

Phe Gly Pro Ala 
335 

His Thr Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 

400 

Pro Thr Pro Val 
415 

Ala Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

Val Val Glu Val 

480 

Lys Arg Pro Ala 
495 

Arg Glu Ser Val 
510 

Asn Tyr Ala Asp 
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515 




520 


Arg Tyr 


Gin 


Asn 


Lys Cys Ser Arg 




530 






535 


Phe 


Pro 


Cys 


Arg 


Gin Cys Glu Arg 


545 






550 


Phe 


Thr 


His 


Gly 


Gin Lys Asp Cys 








565 


Ser 


Gin 


Pro 


Val 


Ser Val Val Lys 








580 




He 


His 


His 


He 


Met Gly Lys Val 






595 




600 


Leu 


Val 


Asn 


Val 


Asp Leu Asp Asp 




610 






615 



<210> 190 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutant rep protein: rep52 



<400> 190 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 






20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 








40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 








70 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 








165 








Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 








180 










He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 






195 










200 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 




210 










215 




Glu 


Leu 


Thr 


Arg 


Arg 


Leu 


Asp 


His 


225 










230 






Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Arg 


Trp 










245 








Glu 


His 


Glu 


Phe 


Tyr 


Val 


Lys 


Lys 








260 










Pro 


Ser 


Asp 


Ala 


Asp 


He 


Ser 


Glu 






275 










280 


Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 




290 










295 
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525 



His 


Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 








540 










Met 


Asn 


Gin 


Asn 


Ser 


Asn 


He 


Cys 






555 










560 


Leu 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 




570 








575 




Lys 


Ala 


Tyr 


Gin 


Lys 


Leu 


Cys 


Tyr 


585 










590 






Pro Asp Ala 


Cvs 


Thr 


Ala 


Cys 


Asp 










605 








Cys 


He 


Fne 


Glu 


Gin 












620 










429 


GCC 














Asp 


Lys 


Gly 


He 


Thr 


Ser 


Glu 


Lvs 


10 










15 




Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 


25 








30 






Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Gly 


Lys 

-< 








45 








Pro Asp 


Tyr 


Leu 


Val 


Gly 

** 


Gin 


Gin 








60 










Arg 


He 


Tyr 


Lys 


He 


Leu 


Glu 


Leu 




75 










80 


Ala 


Ser 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 




90 










95 




Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 


105 








110 






Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 










125 








Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 








140 










Trp 


Glu 


Glu 


Gly Lys 


Met 


Thr 


Ala 




155 










160 


He Leu Gly 


Gly Ser 


Lys 


Val 


Arg 




170 










175 




Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 


185 








190 






Cys 


Ala 


Val 


He 


Ala 


Gly 


Asn 


Ser 








205 








Leu Gin Asp 


Arg 


Met 


Phe 


Lys 


Phe 








220 










Asp 


Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 




235 










240 


Ala 


Lys 


Asp 


His 


Val 


Val 


Glu 


Val 




250 










255 




Gly Gly Ala 


Lys 


Lys 


Arg 


Pro 


Ala 


265 










270 






Pro 


Lys 


Arg 


Val 


Arg 


Glu 


Ser 


Val 








285 








Glu 


Ala 


Ser 


He 


Asn 


Tyr 


Ala 


Asp 



300 
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Arg 


Tyr 


Gin 


Asn 


305 








Phe 


Pro 


Cys Arg 


Phe 


Thr 


His 


Gly 








340 


Ser 


Gin 


Pro 


Val 






355 




lie 


His 


His 


He 




370 






Leu 


Val 


Asn 


Val 


385 









Lys Cys Ser Arg 
310 

Gin Cys Glu Arg 
325 

Gin Lys Asp Cys 

Ser Val Val Lys 

360 

Met Gly Lys Val 
375 

Asp Leu Asp Asp 
390 



His Val Gly Met 
315 

Met Asn Gin Asn 
330 

Leu Glu Cys Phe 
345 

Lys Ala Tyr Gin 

Pro Asp Ala Cys 

380 

Cys He Phe Glu 
395 



Asn Leu Met Leu 

320 

Ser Asn He Cys 
335 

Pro Val Ser Glu 
350 

Lys Leu Cys Tyr 
365 

Thr Ala Cys Asp 
Gin 



<210> 191 
<211> 536 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Mutant rep protein: rep68 429 GCC 



<400> 191 



Thr 


Ala 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




50 






Thr 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Thr 


Gly 


Val 






100 


Arg 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


I*he 




130 






Asn 


Lys 


Val 


Val 


145 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cys 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


225 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


He 


Met 


Ser 


Leu 






275 




Pro 


Val 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 



Tyr 


Glu 


He 


Val 


5 








Gly 


He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








40 


Leu 


Thr 


Val 


Ala 






55 




Arg 


Val 


Ser 


Lys 




70 






Gly 


Glu 


Ser 


Tyr 


85 








Lys 


Ser 


Met 


Val 


He 


Gin 


Arg 


He 








12 0 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Glu 


Cys 


Tyr 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Thr 


Glu 


Ser 


Gin 


Thr 


Gin 








200 


Ala 


Pro 


Val 


He 






215 




Gly 


Trp 


Leu 


Val 




230 






Glu 


Asp 


Gin 


Ala 


245 








Ser 


Gin 


He 


Lys 


Thr 


Lys 


Thr 


Ala 








280 


He 


Ser 


Ser 


Asn 






295 




Pro 


Gin 


Tyr 


Ala 



He 


Lys 


Val 


Pro 




10 






Ser 


Phe 


Val 


Asn 


25 








Ser Asp 


Met 


Asp 


Glu 


Lys 


Leu 


Gin 






60 


Ala 


Pro 


Glu 


Ala 






75 




Phe 


His 


Met 


His 




90 






Leu 


Gly 


Arg 


Phe 


105 








Tyr 


Arg 


Gly He 


Thr 


Arg 


Asn Gly 








140 


He 


Pro 


Asn Tyr 






155 




Trp 


Thr 


Asn 


Met 




170 






Arg 


Lys 


Arg 


Leu 


185 








Glu 


Gin 


Asn Lys 


Arg 


Ser 


Lys 


Thr 








220 


Asp 


Lys 


Gly He 






235 




Ser 


Tyr 


He 


Ser 




250 






Ala 


Ala 


Leu 


Asp 


265 






Pro 


Asp 


Tyr 


Leu 


Arg 


He 


Tyr 


Lys 








300 


Ala 


Ser 


Val 


Phe 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 

Leu Phe Phe Val 

80 

Val Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 

160 

Glu Gin Tyr Leu 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 

240 

Phe Asn Ala Ala 
255 

Asn Ala Gly Lys 
270 

Val Gly Gin Gin 
285 

He Leu Glu Leu 
Leu Gly Trp Ala 
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305 










310 






Tnr 


Lys 


Lys 


Phe 


Gly Lys Arg 


Asn 


Thr 








325 








Thr 


ml, 

Tnr 


Gly 


Lys 


Thr 


Asn He 


Ala 


Glu 






340 








345 


Pne 


Tyr 


Gly 


Cys 


Val 


Asn Trp 


Thr 


Asn 




355 






360. 




Cys 


Val 


Asp 


Lys 


Met 


Val He 


Trp 


Trp 


370 








375 






Lys 


Val 


Val 


Glu 


Ser Ala Lys Ala He 


385 










390 






Val 


Asp 


Gin 


Lys 


Cys 


Lys Ser 


Ser 


Ala 








405 








He 


Val 


Thr 


Ser 
420 


Asn 


Thr Asn 


Met 


Cys 
425 


Thr 


Thr 


Phe^Glu 


His 


Gin Gin 


Pro 


Leu 






435 








440 




Glu 


Leu 


Thr Arg 


Arg 


Leu Asp 


His 


Asp 




450 








455 






Glu 


Val 


Lys 


Asp 


Phe 


Phe Arg 


Trp 


Ala 


465 










470 






Glu 


His 


Glu 


Phe 


Tyr 
485 


Val Lys 


Lys 


Gly 


Pro 


Ser 


Asp 


Ala 


Asp 


He Ser 


Glu 


Pro 






500 








505 


Ala 


Gin 


Pro 
515 


Ser 


Thr 


Ser Asp 


Ala 
520 


Glu 


Arg 


Leu 


Ala 


Arg 


Gly 


His Ser 


Leu 






530 








535 
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<400> 192 



Met 


Glu 


Leu 


Val 


Gly 
5 


Trp 


Leu 


Val 


Asp 


1 
Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 






20 










25 


Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 






35 








40 




He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 


Pro 




50 










55 






Pro 


Val 


Glu 


Asp 


He 


Ser 


Ser 


Asn Arg 


65 










70 








Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 










85 










Thr 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 








100 










105 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 






115 








120 




Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 




130 










135 






Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


Trp 


145 










150 








Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 



165 
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315 






320 


-L- -I. XXy J-lt^ \JL * Uw 


Gly 


Pro 


Ala 


330 




335 




J-U-d lie nXa I1J.D 


Thr 


Val 


Pro 




350 






Glu Asn Phe Pro 


Phe 


Asn Asp 


365 








Glu Glu Gly Lys 


Met 


Thr 


Ala 


380 








Leu Gly Gly Ser 


Lys 


Val 


Arg 


395 






400 


Gin He Asp Pro 


Thr 


Pro 


Val 


410 




415 




Ala Val He Ala 


Gly Asn 


Ser 




430 






Gin Asp Arg Met 


Phe 


Lys 


Phe 


445 








Phe Gly Lys Val 


Thr 


Lys 


Gin 


460 








Lys Asp His Val 


Val 


Glu 


Val 


475 






480 


Gly Ala Lys Lys 


Arg 


Pro 


Ala 


490 




495 




Lys Arg Val Arg 


Glu 


Ser 


Val 




510 






Ala Ser He Asn 


Tyr 


Ala 


Asp 



525 



GCC 



Lys 


Gly 


He 


Thr 


Ser Glu 


Lys 


10 








15 




Tyr 


He 


Ser 


Phe 


Asn Ala 


Ala 








30 




Ala 


Leu Asp 


Asn 


Ala Gly 


Lys 








45 






Asp 


Tyr 


Leu 
60 


Val 


Gly Gin 


Gin 


He 


Tyr 


Lys 


He 


Leu Glu 


Leu 




75 






80 


Ser 


Val 


Phe 


Leu 


Gly Trp 


Ala 


90 








95 




He 


Trp 


Leu 


Phe 


Gly Pro 


Ala 








110 




Ala 


He 


Ala 


His 
125 


Thr Val 


Pro 


Glu 


Asn 


Phe 
140 


Pro 


Phe Asn 


Asp 


Glu Glu Gly Lys 


Met Thr 


Ala 




155 








160 


Leu Gly Gly 


Ser 


Lys Val 


Arg 


170 








175 





